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CDR-GRAFTED ANTI-TISSUE FACTOR 
1 ANTIBODIES AND METHODS OF USE THEREOF 

FIELD OF THE INVENTION 

cj Monoclonal antibodies capable of inhibiting 

tissue factor (TF) are useful as anticoagulants. 
Conventional rodent monoclonal antibodies, however, have 
limited use in human therapeutic and diagnostic 
applications due to immunogenic ity and short serum half- 

10 life. The present invention provides CDR-grafted 

monoclonal antibodies against TF that retain the high 
binding affinity of rodent antibodies but have reduced 
immunogenicity . The present humanized antibodies are 
potent anticoagulants and are thus useful in the 

25 treatment and prophylaxis of human thrombotic disease. 
The invention also provides methods of making the CDR- 
grafted antibodies and pharmaceutical compositions for 
the attenuation or prevention of coagulation. 

20 BACKGROUND OF THE INVENTION 

The coagulation of blood involves a cascading 
series of reactions leading to the formation of fibrin. 
The coagulation cascade consists of two overlapping 

25 pathways, both of which are required for hemostasis. 

The intrinsic pathway comprises protein factors present 
in circulating blood, while the extrinsic pathway 
requires tissue factor, which is expressed on the cell 
surface of a variety of tissues in response to vascular 

30 injury. Davie et al., 1991, Biochemistry 30:10363. 

Agents that interfere with the coagulation cascade, such 
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as heparin and coumarin derivatives, have well-known 
1 therapeutic uses in the prophylaxis of venous 
thrombosis. Goodman and Gilman, eds., 1980, The 
Pharmacological Basis of Therapeutics , MacMillan 
Publishing Co., Inc., New York. 
5 Tissue factor (TF) has been investigated as a 

target for anticoagulant therapy. TF is a membrane 
glycoprotein that functions as a receptor for factor VII 
and Vila and thereby initiates the extrinsic pathway of 
the coagulation cascade in response to vascular injury. 

10 In addition to its role in the maintenance of hemostasis 
by initiation of blood clotting, TF has been implicated 
in pathogenic conditions. Specifically, the synthesis 
and cell surface expression of TF has been implicated in 
vascular disease (Wilcox et al . , 1989, Proc . Natl . Acad . 

15 Sci . j}6: 2839) and gram-negative septic shock (Warr et 
al., 1990, Blood 75:1481) . 

Ruf et al. { 199 1 , Thrombosis and Haemostasis 
£6:529) characterized the anticoagulant potential of 
murine monoclonal antibodies against human TF. The 

20 inhibition of TF function by most of the monoclonal 
antibodies that were assessed was dependent upon the 
dissociation of the TF/VIIa complex that is rapidly 
formed when TF contacts plasma. Such antibodies were 
thus relatively slow inhibitors of TF in plasma* One 

25 monoclonal antibody, TFB-5G9, was capable of inhibiting 
the TF/VIIa complex without dissociation of the complex, 
thus providing an immediate anticoagulant effect in 
plasma. Ruf et al. suggest that mechanisms that 
inactivate the TF/VIIa complex, rather than prevent its 

30 formation, may provide strategies for interruption of 
coagulation in vivo . 
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The therapeutic use of monoclonal antibodies 
1 against TF is limited in that currently available 
monoclonals are of rodent origin. The use of rodent 
antibodies in human therapy presents numerous problems, 
the most significant of which is immunogenicity . 
5 Repeated doses of rodent monoclonal antibodies have been 
found to elicit an anti- immunoglobulin response termed 
human anti-mouse antibody (HAMA), which can result in 
immune complex disease and/or neutralization of the 
therapeutic antibody. See, e.g. , Jaffers et al. (1986) 
10 Transplantation 41:572. While the use of human 

monoclonal antibodies would address this limitation, it 
has proven difficult to generate large amounts of human 
monoclonal antibodies by conventional hybridoma 
technology. 

15 Recombinant technology has been used in an 

effort to construct "humanized" antibodies that maintain 
the high binding affinity of rodent monoclonal 
antibodies but exhibit reduced immunogenicity in humans. 
Chimeric antibodies have been produced in which the 

20 variable (V) region of a mouse antibody is combined with 
the constant (C) region of a human antibody in an effort 
to maintain the specificity and affinity of the rodent 
antibody but reduce the amount of protein that is non- 
human and thus immunogenic. While the immune response 

25 to chimeric antibodies is generally reduced relative to 
the corresponding rodent antibody, the immune response 
cannot be completely eliminated, because the mouse V 
region is capable of eliciting an immune response. 
Lobuglio et al. (1989) Proc. Natl. Acad. Scl. 86:4220; 

30 Jaffers et al. (1986) Transplantation 41: 572 . 
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In a recent approach to reducing 
1 immunogenicity of rodent antibodies, only the rodent 
complementarity determining regions (CDRs), rather than 
the entire V domain, are transplanted to a human 
antibody. Such humanized antibodies are known as CDR- 
5 grafted antibodies . CDRs are regions of 

hypervariability in the V regions that are flanked by 
relatively conserved regions known as framework (FR) 
regions. Each V domain contains three CDRs flanked by 
four FRs . The CDRs fold to form the antigen binding 

10 site of the antibody, while the FRs support the 

structural conformations of the V domains. Thus by 
transplanting the rodent CDRs to a human antibody, the 
antigen binding domain can theoretically also be 
transferred. Owens et al. (1994) J. Immunol* Methods 

15 168 : 149 and Winter et al. (1993) Immunology Today 14 :243 
review the development of CDR-grafted antibodies. 

Orlandi et al. (1989) Froc. Natl. Acad. Sci. 
USA 86:3833 constructed a humanized antibody against the 
relatively simple hapten nitrophenacetyl (NP). The CDR- 

20 grafted antibody contained mouse CDRs and human FRs, and 
exhibited NP binding activity similar to the native 
mouse antibody. However, the construction of CDR- 
grafted antibodies recognizing more complex antigens has 
resulted in antibodies having binding activity 

25 significantly lower than the native rodent antibodies. 
In numerous cases it has been demonstrated that the mere 
introduction of rodent CDRs into a human antibody 
background is insufficient to maintain full binding 
activity, perhaps due to distortion of the CDR 

30 conformation by the human FR. 
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For example, Gorman et al. (1991) Proc, Natl. 
1 Acad. Sci. 88:4181 compared two humanized antibodies 
against human CD 4 and observed considerably different 
avidies depending upon the particular human framework 
region of the humanized antibody. Co et al. (1991) 
5 Proc. Natl. Acad. Sci. USA 88:2869 required a refined 
computer model of the murine antibody of interest in 
order to identify critical amino acids to be considered 
in the design of a humanized antibody. Kettleborough et 
£!• (1991) Protein Engineering 4:773 report the 

10 influence of particular FR residues of a CDR-grafted 
antibody on antigen binding, and propose that the 
residues may directly interact with antigen, or may 
alter the conformation of the CDR loops w Similarly, 
Singer et al. (1993) J. Immunol. 150 :2844 report that 

15 optimal humanization of an anti-CD18 murine monoclonal 
antibody is dependent upon the ability of the selected 
FR to support the CDR in the appropriate antigen binding 
conformation. Accordingly, recreation of the antigen- 
binding site requires consideration of the potential 

20 intrachain interactions between the FR and CDR, and 
manipulation of amino acid residues of the FR that 
maintain contacts with the loops formed by the CDRs. 
While general theoretical guidelines have been proposed 
for the design of humanized antibodies ( see, e.g. , Owens 

25 et al . ) , in all cases the procedure must be tailored and 
optimized for the particular rodent antibody of 
interest. 

There is a need in the art for humanized 
antibodies with reduced immunogenicity and comparable 
30 binding affinity relative to the parent rodent antibody 
for various therapeutic applications. In particular, 
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there is a need for a humanized antibody against human 
1 tissue factor having anticoagulant activity and useful 
in the treatment and prevention of thrombotic disease. 

SUMMARY OF THE INVENTION 

5 

The present invention is directed to CDR- 
grafted antibodies capable of inhibiting human tissue 
factor wherein the CDRs are derived from a non-human 
monoclonal antibody against tissue factor and the FR and 

10 constant (C) regions are derived from one or more human 
antibodies. In a preferred embodiment, the murine 
monoclonal antibody is TF8-5G9. 

In another embodiment, the present invention 
provides a method of producing a CDR-grafted antibody 

15 capable of inhibiting human tissue factor which method 
comprises constructing one or more expression vectors 
containing nucleic acids enccding CDR-grafted antibody 
heavy and light chains, transfecting suitable host cells 
with the expression vector or vectors, culturing the 

20 transfected host cells, and recovering the CDR-grafted 
antibody. 

The present invention also provides a method 
of attenuation of coagulation comprising administering a 
CDR-grafted antibody capable of inhibiting human tissue 
25 factor to a patient in need of such attenuation. 

The present invention further provides a 
method of treatment or prevention of thrombotic disease 
comprising administering a CDR-grafted antibody capable 
of Inhibiting human tissue factor to a patient in need 
30 of such treatment or prevention. In a preferred 
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embodiment, the thrombotic disease is intravascular 
1 coagulation, arterial restenosis or arteriosclerosis. 

Another embodiment of the present invention is 

directed to a pharmaceutical composition comprising CDR- 

grafted antibodies capable of inhibiting human tissue 
5 factor and further comprising a pharmaceutically 

acceptable carrier. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 F ig- 1 provides the nucleotide and deduced 

amino acid sequences of the heavy chain of murine 
monoclonal antibody TF8-5G9. 

Fig. 2 provides the nucleotide and deduced 
amino acid sequences of the light chain of murine 

15 monoclonal antibody TF8-5G9. 

Fig. 3 is a graph depicting the ability of 
CDR-grafted antibody TFSKCDR1 x TFSLCDRI to bind to 
human tissue factor and to compete with murine 
monoclonal antibody TF85G9 for binding to tissue factor. 

20 Solid symbols indicate direct binding of TF8HCDR1 x 
TF8LCDR1 and the positive control chimeric TF85G9 to 
tissue factor. Open symbols indicate competition 
binding of TF8HCDR1 x TF8LCDR1 or chimeric TF85G9 with 
murine monoclonal antibody TF85G9. 

25 Fig. 4 presents the DNA sequence of expression 

vector pEe6TF8HCDR20 and the amino acid sequence of the 
coding regions of the CDR-grafted heavy chain TF8HCDR20. 

Fig. 5 presents the DNA sequence of expression 
vector pEel2TF8LCDR3 and the amino acid sequence of the 

30 coding regions of the CDR-grafted light chain TF8LCDR3. 
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Fig. 6 is a graph depicting the ability of 
1 CDR-grafted antibody TF8HCDR20 x TF8LCDR3 to bind to 
human tissue factor. 

Fig. 7 is a graph depicting the ability of 
CDR-grafted antibody TF8HCDR20 x TF8LCDR3 to compete 
5 with murine monoclonal antibody TF85G9 for binding to 
tissue factor. 

Fig. 8 is a graph depicting the ability of 
CDR-grafted antibody TF8HCDR20 x TF8LCDR3 to inhibit 
factor X activation. 
10 Fig. 9 provides expression vector 

pEe 6 TF 8 HCDR2 0 resulting from the subcloning of CDR- 
grafted heavy chain TF8HCDR20 into myeloma expression 
vector pEehCMV-Bgl I . The following abbreviations are 
used: VH is the CDR-grafted heavy chain variable 
.15 region; Cy4 is the human IgG4 constant region; pA is the 
polyadenylation signal; ampR is the fl-lactamase gene; 
and hCMV is human cytomegalovirus. 

Fig. 10 provides expression vector 
pEel2TF8LCDR3 resulting from the subcloning of CDR- 
20 grafted light chain TF8LCDR3 into myeloma expression 

vector pEel2. The following abbreviations are used: VL 
is the CDR-grafted light chain variable region; CK is 
the human kappa constant region; SVE is the SV40 early 
promoter; GS is glutamine synthetase cDNA. Other 
25 abbreviations are as noted in Fig. 9. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides CDR-grafted 
30 antibodies capable of inhibiting human tissue factor 

wherein the CDRs are derived from a non-human monoclonal 
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antibody against tissue factor and the FR and C regions 
1 are derived from one or more human antibodies. The 

present invention further provides methods of making and 
using the subject CDR-grafted antibodies. 

In accordance with the present invention, the 
5 CDR-grafted antibody is an antibody in which the CDRs 
are derived from a non-human antibody capable of binding 
to and inhibiting the function of human tissue factor, 
and the FR and C regions of the antibody are derived 
from one or more human antibodies. The CDRs derived 

10 from the non-human antibody preferably have from about 
90% to about 100% identity with the CDRs of the non- 
human antibody, although any and all modifications, 
including substitutions, insertions and deletions, are 
contemplated so long as the CDR-grafted antibody 

15 maintains the ability to bind to and inhibit tissue 

factor. The regions of the CDR-grafted antibodies that 
are derived from human antibodies need not have 100% 
identity with the human antibodies. In a preferred 
embodiment, as many of the human amino acid residues as 

20 possible are retained in order than immunogenicity is 
negligible, but the human residues, in particular 
residues of the FR region, are substituted as required 
and as taught hereinbelow in accordance with the present 
invention. Such modifications as disclosed herein are 

25 necessary to support the antigen binding site formed by 
the CDRs while simultaneously maximizing the 
humanization of the antibody. 

Non-human monoclonal antibodies against human 
tissue factor from which the CDRs can be derived are 

30 known in the art (Ruf et al . , 1991; Morrisey et ai . , 

1988, Thrombosis Research 52:247) or can be produced by 
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well-known methods of monoclonal antibody production 
1 (see, e.g. Harlow et al . , eds., 1988, Antibodies, A 

Laboratory Manual , Cold Spring Harbor Laboratories, Cold 
Spring Harbor, New York). Purified human tissue factor 
against which monoclonal antibodies can be raised is 
5 similarly well-known (Morrisey et al. , 1987, Cell 

50:129) and available to the skilled artisan. Murine 
monoclonal antibodies, and in particular murine 
monoclonal antibody TF8-5G9 disclosed by Ruf et al. and 
Morrisey et al. , 1988, Thrombosis Research 52:247, and 

10 U.S. Patent No. 5,223,427 are particularly preferred. 

The ordinarily skilled artisan can determine 
the sequences of the CDRs by reference to published 
scientific literature or sequence databanks, or by 
cloning and sequencing the heavy and light chains of the 

15 antibodies by conventional methodology. In accordance 
with the present invention, the cDNA and amino acid 
sequences of the heavy chain (SEQ ID NOS : I and 2, 
respectively) and light chain (SEQ ID NOS: 3 and 4, 
respectively) of murine monoclonal antibody TF8-5G9 are 

20 provided. The cDNA and deduced amino acid sequence of 
the murine TF8-5G9 heavy chain is provided at Figure 1. 
The cDNA and deduced amino acid sequence of the murine 
TF8-5G9 light chain is provided at Figure 2. 

Each of the heavy and light chain variable 

25 regions contain three CDRs that combine to form the 

antigen binding site. The three CDRs are surrounded by 
four FR regions that primarily function to support the 
CDRs. The sequences of the CDRs within the sequences of 
the variable regions of the heavy and light chains can 

30 be identified by computer-assisted alignment according 
to Kabat et al. (1987) in Sequences of Proteins of 
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Immunological Interest , 4th ed., United States 
1 Department of Health and Human Services, US Government 
Printing Office, Washington, D.C., or by molecular 
modeling of the variable regions, for example utilizing 
the ENCAD program as described by Levitt (1983) J. Mol. 
5 Biol. 168:595. 

In a preferred embodiment the CDRs are derived 
from murine monoclonal antibody TF8-5G9. The preferred 
heavy chain CDRs have the following sequences: 

10 CDR1 DDYMH (SEQ ID NO: 5) 

CDR2 LIDPENGNTIYDPKFQG (SEQ ID NO: 6) 

CDR3 DNSYYFDY (SEQ ID NO: 7) 

The preferred light chain CDRs have the following 
15 sequences: 

CDR1 KASQDIRKYLN (SEQ ID NO: 8) 

CDR2 YATSLAD (SEQ ID NO: 9) 

CDR3 LQHGESPYT (SEQ ID NO: 10) 

20 

The sequences of the CDRs of the murine or other non- 
human antibody, and in particular the sequences of the 
CDRs of TF8-5G9, may be modified by insertions, 
substitutions and deletions to the extent that the CDR- 

25 grafted antibody maintains the ability to bind to and 
inhibit human tissue factor. The ordinarily skilled 
artisan can ascertain the maintenance of this activity 
by performing the functional assays described 
hereinbelow. The CDRs can have, for example, from about 

30 50% to about 100% homology to the CDRs of SEQ ID NOS:5- 
10. In a preferred embodiment the CDRs have from about 



35 



WO 96/40921 



-12- 



PCT/US96/09287 



80% to about 100% homology to the CDRs of SEQ ID NOS:5- 
1 10. In a more preferred embodiment the CDRs have from 
about 90% to about 100% homology to the CDRs of SEQ ID 
NOS:5-10. In a most preferred embodiment the CDRs have 
from about 100% homology to the CDRs of SEQ ID NOS:5-10. 
5 The FR and C regions of the CDR-grafted 

antibodies of the present invention are derived from one 
or more human antibodies. Human antibodies of the same 
class and type as the antibody from which the CDRs are 
derived are preferred. The FR of the variable region of 

10 the heavy chain is preferably derived from the human 
antibody KOL (Schmidt et al., 1983, Hoppe-Seyler 's Z. 
Physiol. Chem. 364 :713) The FR of the variable region 
of the light chain is preferably derived from the human 
antibody RE I (Epp et al . , 1974, Eur. J. Biochem. 

15 45:513). In accordance with the present invention, it 
has been discovered that certain residues of the human 
FR are preferably replaced by the corresponding residue 
of the non-human antibody from which the CDRs are 
derived. For example, certain FR residues of TF8-5G9 

20 are preferably retained to achieve optimal binding to 
antigen. 

For convenience, the numbering scheme of Kabat 
et al . has been adopted herein. Residues are designated 
by lower case numbers or hyphens as necessary to conform 
25 the present sequences to the standard Kabat numbered 
sequence. 

In accordance with the present invention, 
residues that are retained in the FR region, i.e 
residues that are not replaced by human FR residues, are 
30 determined according to the following guidelines. 

Residues that are idiosyncratic to the parent antibody, 
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e.g. TF8-5G9, relative to a human consensus sequence of 
1 Kabat et al, are retained. Residues of the parent 
antibody that are in agreement with the consensus 
sequence are retained if the corresponding residue of 
the human antibody, e.g. KOL or REI, is idiosyncratic. 
5 Residues that are part of the antibody loop canonical 
structures defined by Chothia et al. (1989) Nature 
342 :877, such as residue 71 of the heavy and light 
chains, are retained. FR residues predicted to form 
loops, such as residues 28-30 of the heavy chain, are 

10 retained. FR residues predicted to influence the 
conformation of the CDRs such as residues 48 and 49 
preceding CDR2 of the heavy chain, are retained. 
Residues that have been demonstrated to be critical in 
the humanization of other antibodies may also be 

15 retained. The foregoing guidelines are followed to the 
extent necessary to support the antigen binding site 
formed by the CDRs while simultaneously maximizing the 
humanization of the antibody* 

The amino acid sequence of a representative 

20 CDR-grafted heavy chain variable region derived from 
murine monoclonal antibody TF8-5G9 and human antibody 
KOL is shown below. The CDR-grafted heavy chain is 
designated TF8HCDR1; murine residues were retained in 
the FR at residues 6, 17, 23, 24, 28, 29, 30, 48, 49, 

25 68, 71, 73, 78, 88 arid 91. CDRs are underlined. 

10 20 30 35ab 50 

QVQLVQSGGG WQPGRLLRL SCKASGFNIK PYYMH — WVR QAPGKGLEWIG 
52abc 60 70 80 82abc 90 

LIDP — ENGNTIYD PKFQGR FSIS ADTSK — RTAFL QMDSLRPEDTAVY 
100 110 
30 YCAR DWSYYF DYWGQGTPVT VSS (SEQ ID NO: 11) 
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The amino acid sequence of a representative 
1 CDR-grafted light chain variable region derived from 
murine monoclonal antibody TF8-5G9 and human antibody 
RE I is shown below. The CDR-grafted light chain is 
designated TF8LCDR1 ; murine residues were retained in 
5 the FR at residues 39, 41, 46 and 105. CDRs are 
underlined. 

10 20 30 40 50 

DIQMTQSPSS LSASVGDRVT IT CKASQDIR KYLNWYQQK WKAPKTLIYY 

60 70 80 90 100 

ATSLADG VPS RFSGSGSGTD YTFTISSLQP EDIATYYC LQ HGESPYT FGQ 

GTKLEITR (SEQ ID NO: 12) 

A CDR-grafted antibody containing variable 
15 regions TF8HCDR1 and TF8LCDR1 has been demonstrated in 
accordance with the present invention to be as effective 
as murine monoclonal antibody TF8-5G9 in binding to 
human tissue factor. It has been further discovered in 
accordance with the present invention, by examination of 
20 the molecular structure of murine monoclonal antibody 
TF8-5G9, and by design, construction, and analysis of 
CDR-grafted antibodies, that the FR regions can be 
further humanized without the loss of antigen binding 
activity. In particular, the FR region may retain the 
25 human FR residue at residues 6, 17, 68, 73 and 78 of the 
heavy chain, and residues 39, 41, 16 and 105 of the 
light chain, with maintenance of antigen binding 
activity. 

In a most preferred embodiment, the heavy 
30 chain variable region contains a FR derived from human 
antibody K0L in which murine monoclonal antibody TF8-5G9 
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residues are retained at amino acids 23, 24, 28, 29, 30, 
1 48, 49, 71, 88 and 91. The preferred heavy chain 
variable region is designated TF8HCDR20 and has the 
following sequence. 

5 10 20 30 35ab 50 

QVQLVESGGG WQPGRSLRL SCKASGFNIK DYYMH — WVR QAPGKGLEWIGL 

52abc 60 70 80 82abc 90 100 

IDP — ENGNTIYD PKFQGR FTIS ADNSKNTLFL QMDSLRPEDTAVY YCAR DNSYYF 

10 no 

DYWGQGTPVT VSS (SEQ ID NO: 13) 

In a most preferred embodiment, the light 
chain variable region contains a FR derived from human 
^ antibody REI in which murine monoclonal antibody TF8-5G9 
residues are retained at amino acids 39 and 105. The 
preferred light chain variable region is designated 
TF8LCDR20 and has the following sequence. 

10 20 30 40 50 

DIQMTQSPSS LSASVGDRVT ITCKASQDIR KYLNWYQQKP GKAPKLLIYY 
CWl 60 70 80 90 100 

ATSLADGVPS RFSGSGSGTD YTFTISSLQP EDIATYY CLQ HGESPYTF GQ 
GTKLEITR (SEQ ID NO: 14) 

It is within the ken of the ordinarily skilled 
25 artisan to make minor modifications of the foregoing 

sequences, including amino acid substitutions, deletions 
and insertions. Any such modifications are within the 
scope of the present invention so long as the resulting 
CDR-grafted antibody maintains the ability to bind to 
30 and inhibit human tissue factor. The ordinarily skilled 
artisan can assess the activity of the CDR-grafted 
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antibody with reference to the functional assays 
1 described hereinbelow. 

The human constant region of the CDR-grafted 
antibodies of the present invention is selected to 
minimize effector function. The intended use of the 
5 CDR-grafted antibodies of the present invention is to 
block the coagulation cascade by inhibition of tissue 
factor, and thus antibody effector functions such as 
fixation of complement are not desirable. Antibodies 
with minimal effector functions include IgG2, IgG4, IgA, 

10 IgD and IgE. In a preferred embodiment of the present 
invention, the heavy chain constant region is the human 
IgG4 constant region, and the light chain constant 
region is the human IgG4 kappa constant region. 

In that effector functions may not be 

15 desirable for therapeutic uses, the present invention 

further contemplates active fragments of the CDR-grafted 
antibodies, and in particular Fab fragments and F(ab') 2 
fragments. Active fragments are those fragments capable 
of Inhibiting human tissue factor. Fab fragments and 

20 F ( ab ' ) 2 fragments may be obtained by conventional means , 
for example by cleavage of the CDR-grafted antibodies of 
the invention with an appropriate proteolytic enzyme 
such as papain or pepsin, or by recombinant production. 
The active fragments maintain the antigen binding sites 

25 of the CDR-grafted antibodies and thus are similarly 
useful therapeutically. 

The ability of the CDR-grafted antibodies 
designed and constructed as taught in accordance with 
the present invention to bind and inhibit human tissue 

30 factor can be assessed by functional assays. For 
example, in a rapid and convenient assay, expression 
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vectors containing nucleic acids encoding the CDR- 
1 grafted heavy and light chains can be co-trans fected 
into suitable host cells and transiently expressed. The 
resulting antibodies can be assessed by standard assays 
for ability to bind human tissue factor, and for ability 
5 to compete for binding to tissue factor with the non- 
human antibody from which the CDRs are derived. 

For example, transient expression of nucleic 
acids encoding the CDR-grafted heavy and light chains in 
COS cells provides a rapid and convenient system to test 

10 antibody gene expression and function. Nucleic acids 
encoding the CDR-grafted heavy and light chains, 
respectively, are cloned into a mammalian cell 
expression vector, for example pSG5, described by Green 
et al. (1988) Nucleic Acids Res. 16:369 and commercially 

15 available from Stratagene Cloning Systems, La Jolla, CA. 
The pSG5 expression vector provides unique restriction 
sites for the insertion of the heavy and light chain 
genes, and in vivo expression is under the control of 
the SV40 early promoter. Transcriptional termination is 

20 signaled by the SV40 polyadenylation signal sequence. 

The pSG5 -based expression vectors containing 
nucleic acids encoding the heavy and light chains are 
cotrans fected into COS cells and cultured under 
conditions suitable for transient expression. Cell 

25 culture media is then harvested and examined for 

antibody expression, for example by an enzyme linked 
immunosorbent assay (ELISA) , to determine that suitable 
levels of antibody have been produced. An ELISA may 
then be used to assess the ability of the CDR-grafted 

30 antibody to bind to human tissue factor. Human tissue 
factor is immobilized on a microtiter plate and the COS 
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cell supernatant containing the CDR-grafted antibody is 
1 added followed by an incubation at room temperature for 
about one hour. The plates are then washed with a 
suitable detergent-containing buffer such as phosphate 
buffered saline (PBS)/Tween, followed by the addition of 
5 the components of a suitable detection system. For 
example, horseradish peroxidase conjugated goat anti- 
human kappa chain polyclonal antibody is added, followed 
by washing, followed by addition of substrate for 
horseradish peroxidase, and detection. The CDR-grafted 

10 antibodies within the scope of the present invention are 
those which are capable of binding to human tissue 
factor to a degree comparable to the non-human antibody 
from which the CDRs are derived as determined by the 
foregoing assay. 

15 The ability of the CDR-grafted antibodies to 

inhibit the activity of human tissue factor in vivo can 
be conveniently assessed by the following in vitro assay 
that mimics in vivo coagulation events . In response to 
vascular injury in vivo , tissue factor binds to factor 

20 VII and facilitates the conversion of factor VII to a 
serine protease (factor Vila). The factor Vila-tissue 
factor complex converts factor X to a serine protease 
(factor Xa) . Factor Xa forms a complex with factor Va 
(from the intrinsic coagulation pathway), resulting in 

25 the conversion of prothrombin to thrombin, which in turn 
results in the conversion of fibrinogen to fibrin. In a 
convenient in vitro functional assay, tissue factor is 
incubated in the presence of factor Vila and the CDR- 
grafted anti-tissue factor antibody produced in the 

30 transient expression system described above. Facte r X 
is added and the reaction mixture is incubated, followed 
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by an assay for factor Xa activity utilizing a 
1 chromogenic substrate for factor Xa (Spectrozyme FXa, 
American Diagnostica, Inc., Greenwich, CT) . The ability 
of the CDR-grafted antibody to inhibit factor X 
activation thus provides a measure of the ability of the 
5 CDR-grafted antibody to inhibit the activity of human 
tissue factor. 

The CDR-grafted antibodies within the scope of 
the present invention are those which are capable of 
inhibiting human tissue factor to a degree comparable to 

10 the non-human antibody from which the CDRs are derived 
as determined by the foregoing assay. In one 
embodiment, the CDR-grafted antibody has at least 50% of 
the inhibitory activity of TF8-5G9 for human tissue 
factor i In a preferred embodiment, the CDR-grafted 

15 antibody has at least 70% of the inhibitory activity of 
TF8-5G9 for human tissue factor. In a more preferred 
embodiment, the CDR-grafted antibody has at least 30% of 
the inhibitory activity of TF8-5G9 for human tissue 
factor. In a most preferred embodiment, the CDR-grafted 

20 antibody has at least 90% of the inhibitory activity of 
TF8-5G9 for human tissue factor. 

In another embodiment, the present invention 
provides a method of producing a CDR-grafted antibody 
capable of inhibiting human tissue factor. The method 

25 comprises constructing an expression vector containing a 
nucleic acid encoding the CDR-grafted antibody heavy 
chain and an expression vector containing a nucleic acid 
encoding the CDR-grafted antibody light chain, 
transfecting suitable host cells with the expression 

30 vectors, culturing the transfected host cells under 

conditions suitable for the expression of the heavy and 
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light chains, and recovering the CDR-grafted antibody. 
1 Alternately, one expression vector containing nucleic 
acids encoding the heavy and light chains may be 
utilized. 

Standard molecular biological techniques , for 
5 example as disclosed by Sambrook et al. (1989), 

Molecular Cloning; A Laboratory Manual Cold Spring 
Harbor Press, Cold Spring Harbor, NY may be used to 
obtain nucleic acids encoding the heavy and light chains 
of the CDR-grafted antibodies of the present invention. 

10 A nucleic acid encoding the CDR-grafted variable domain 
may be constructed by isolating cDNA encoding the 
antibody to be humanized, e.g. murine monoclonal 
antibody TF8-5G9, , by conventional cloning methodology 
from the hybridoma producing the antibody, or by 

15 polymerase chain reaction (PCR) amplification of the 
variable region genes, as described for example by 
Winter et al . , followed by site-directed mutagenesis to 
substitute nucleotides encoding the desired human 
residues into the FR regions. Alternately, the cDNA 

20 encoding the human antibody can be isolated, followed by 
site-directed mutagenesis to substitute nucleotides 
encoding the desired murine residues into the CDRs. 

Nucleic acids encoding the CDR-grafted 
variable domain may also be synthesized by assembling 

25 synthetic oligonucleotides, for example utilizing DNA 
polymerase and DNA ligase. The resulting synthetic 
variable regions may then be amplified by PCR. Nucleic 
acids encoding CDR-grafted variable domains may also be 
constructed by PCR strand overlap methods that are known 

30 in the art and reviewed by Owens et al . 
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Accordingly, having determined the desired 
1 amino acid sequences of the CDR-grafted variable domains 
in accordance with the present invention, the ordinarily 
skilled artisan can obtain nucleic acids encoding the 
variable domains. Further, the skilled artisan is aware 
5 that due to the degeneracy of the genetic code, various 
nucleic acid sequences can be constructed that encode 
the CDR-grafted variable domains. All such nucleic acid 
sequence are contemplated by the present invention. 

The nucleic acids encoding the CDR-grafted 

10 variable domains are linked to appropriate . nucleic acids 
encoding the human antibody heavy or light chain 
constant region. Nucleic acid sequences encoding human 
heavy and light chain constant regions are known in the 
art. It is within the ken of the ordinarily skilled 

15 artisan to include sequences that facilitate 

transcription, translation and secretion, for example 
start ccdcns , leader sequences , the Kozak consensus 
sequence (Kozak, 1987, J, Mol. Biol. 196:947) and the 
like, as well as restriction endonuclease sites to 

20 facilitate cloning into expression vectors. 

The present invention thus further provides 
nucleic acids encoding the heavy and light chains of 
CDR-grafted antibodies capable of inhibiting human 
tissue factor wherein the .CDRs are derived from a murine 

25 monoclonal antibody against tissue factor and the FR and 
C regions are derived from one or more human antibodies. 

In accordance with the present invention, 
representative nucleic acids encoding CDR-grafted heavy 
and light chains were constructed. The CDR-grafted 

30 heavy chain comprises a variable region containing FR 

regions derived from human antibody KOL and CDRs derived 
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from murine monoclonal antibody TF8-5G9 and further 
1 comprises a constant region derived from the heavy chain 
of human IgG4. The CDR-grafted light chain comprises a 
variable region containing FR regions derived from human 
antibody RE I and CDRs derived from murine monoclonal 
5 antibody TF8-5G9 and further comprises a constant region 
derived from human IgG4 kappa chain. Nucleic acids 
encoding the heavy and light chains were constructed by 
assembling the variable regions from synthetic 
nucleotides, amplifying the assembled variable regions 

10 by PCR, purifying the amplified nucleic acids, and 

ligating the nucleic acid encoding the variable region 
into a vector containing a nucleic acid' encoding the 
appropriate human constant region. 

The sequences of representative nucleic acids 

15 encoding CDR-grafted heavy and light chains are 

presented as nucleotides 1-2360 of SEQ ID NO: 15 and 
nucleotides 1=759 of SEQ ID NO: 20, respectively. 

The nucleic acid sequence encoding a preferred 
heavy chain (nucleotides 1-2360 of SEQ ID NO:15) is 

20 designated the TF8HCDR20 gene. The nucleic acid 
sequence contains the following regions: 5' EcoRI 
restriction site (nucleotides 1-6); Kozak sequence 
(nucleotides 7-15);. start codon and leader sequence 
(nucleotides 16-72); CDR-grafted variable region 

25 (nucleotides 73-423); human IgG4 CHI domain (nucleotides 
424-717); human IgG4 intron 2 (nucleotides 718-1110); 
human IgG4 hinge (nucleotides 1111-1146); human IgG4 
intron 3 (nucleotides 1147-1267); human IgG4 CH2 domain 
(nucleotides 1268-1594); human IgG4 intron 4 

30 (nucleotides 1595-1691); human IgG4 CH3 domain 
(nucleotides 1692-2012); 3' untranslated region 
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(nucleotides 2013-2354); 3' BamH I end spliced to Bel l 
1 site of expression vector (nucleotides 2355-2360). 

The nucleic acid sequence encoding a preferred 
light chain gene (nucleotides 1-759 of SEQ ID NO: 20) is 
designated the TF8LCDR3 gene. The nucleic acid sequence 
5, contains the following regions: 5' Eco RI restriction 
site (nucleotides 1-5); Kozak sequence (nucleotides 6- 
8); start codon and leader sequence (nucleotides 9-68); 
CDR-grafted variable region (nucleotides 69-392); human 
kappa constant region (nucleotides 393-710); 3* 
10, untranslated region (nucleotides 711-753); 3' Bam HI end 
spliced to Bel l site of expression vector (nucleotides 
754-759). 

The foregoing preferred sequences can be 
modified by the ordinarily skilled artisan to take into 

15 account degeneracy of the genetic code, and to make 
additions, deletions, and conservative and 
nonconservative substitutions that result in a 
maintenance of the function of the nucleic acid, i.e. 
that J.t encodes a heavy or light chain of a CDR-grafted 

20 antibody capable of inhibiting human tissue factor. 
Restriction sites and sequences that facilitate 
transcription and translation may be altered or 
substituted as necessary depending upon the vector and 
host system chosen for expression. 

25 Suitable expression vectors and hosts for 

production of the CDR-grafted antibodies of the present 
invention are known to the ordinarily skilled artisan. 
The expression vectors contain regulatory sequences, 
such as replicons and promoters, capable of directing 

30 replication and expression of heterologous nucleic acids 
sequences in a particular host cell. The vectors may 
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also contain selection genes, enhancers, signal 
1 sequences, ribosome binding sites, RNA splice sites, 
polyadenylation sites, transcriptional terminator 
sequences, and so on. The vectors may be constructed by 
conventional methods well-known in the art, or obtained 
5 from commercial sources. The expression vectors 

preferably have convenient restriction sites at which 
the nucleic acids encoding the antibody chains of the 
invention are inserted. Myeloma expression vectors in 
which antibody gene expression is driven by the human 
10 cytomegalovirus promoter-enhancer or are particularly 
preferred. 

Expression vectors containing a nucleic acid 
encoding the CDR-grafted heavy chain under the control 
of a suitable promoter and expression vectors containing 

15 a nucleic acid encoding the CDR-grafted light chain 
under the control of a suitable promoter are 
cotransf ected into a suitable host ceil. In another 
embodiment, nucleic acids encoding both heavy and light 
chains are provided in a single vector for transfection 

20 of a suitable host cell. 

Suitable host cells or cell lines for 
expression of the CDR-grafted antibodies of the present 
invention include bacterial cells, yeast cells, insect 
cells, and mammalian cells such as Chinese hamster ovary 

25 (CHO) cells, COS cells, fibroblast cells and myeloid 
cells. Mammalian cells are preferred. CHO, COS and 
myeloma cells are particularly preferred. Myeloma cells 
are preferred for establishing permanent CDR-grafted 
antibody producing cell lines. Expression of antibodies 

30 in myeloma cells, bacteria, and yeast is reviewed by 
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Sandhu (1992) Critical Reviews in Biotechnology 12:437. 
1 Expression in mammalian cells is reviewed by Owen et al. 

Transfection of host cells by the expression 
vectors containing nucleic acids encoding the CDR- 
grafted heavy and light chains can be accomplished by 
5 methods well-known to one of ordinary skill in the art. 
Such methods include, for example, calcium chloride 
transfection, calcium phosphate transfection, 
lipofection and electroporation. Suitable culture 
methods and conditions for' the production of the CDR- 

10 grafted antibodies are likewise well-known in the art. 
The CDR-grafted antibodies can be purified by 
conventional methods, including ammonium sulfate 
precipitation, affinity chromatography, gel 
electrophoresis, and the like. The ability of the CDR-. 

15 grafted antibodies to bind to and inhibit human tissue 
factor can be assessed by the in vitro assays described 
above . 

The CDR-grafted antibodies of the present 
invention have a variety of utilities. For example, the 

20 antibodies are capable of binding to human tissue factor 
and thus are useful in assays for human tissue factor 
from body fluid samples, purification of human tissue 
factor, and so on. 

The CDR-grafted antibodies of the present 

25 invention are capable of inhibiting human tissue factor. 
Human tissue factor is well-known to be an essential 
element in the human coagulation cascade. The ability 
of the antibodies of the present invention to disrupt 
the coagulation cascade is demonstrated by in vitro 

30 assays in which the antibodies prevent factor X 

activation. Accordingly, the present antibodies are 
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useful in the attenuation of coagulation. The present 
1 invention thus provides a method of attenuation of 

coagulation comprising administering a therapeutically 
effective amount of CDR-grafted antibody capable of 
inhibiting human tissue factor to a patient in need of 
5 such attenuation. 

Numerous thrombotic disorders are 
characterized by excessive or inappropriate coagulation 
and are effectively treated or prevented by 
administration of agents that interfere with the 

10 coagulation cascade. Accordingly, the present invention 
further provides a method of treatment or prevention of 
a thrombotic disorder comprising administering a 
therapeutically effective amount of a CDR-grafted 
antibody capable of inhibiting human tissue factor to a 

15 patient in need of such treatment or prevention. In a 
preferred embodiment, the thrombotic disorder is 
intravascular coagulation, arterial restenosis or 
arteriosclerosis. The antibodies of the invention may be 
used in combination with other antibodies or therapeutic 

20 agents. 

A therapeutically effective amount of the 
antibodies of the present invention can be determined by 
the ordinarily skilled artisan with regard to the 
patient's condition, the condition being treated, the 

25 method of administration, and so on. A therapeutically 
effective amount is the dosage necessary to alleviate, 
eliminate, or prevent the thrombotic disorder as 
assessed by conventional parameters. For example, a 
therapeutically effective dose of a CDR-grafted antibody 

30 of the present invention may be from about 0.1 mg to 
about 20 mg per 70 kg of body weight. A preferred 
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dosage is about 1.0 mg to about 5 mg per 70 kg of body 
1 weight. 

A patient in need of such treatment is a 
patient suffering from a disorder characterized by 
inappropriate or excessive coagulation, or a patient at 
5 risk of such a disorder. For example, anticoagulant 
therapy is useful to prevent postoperative venous 
thrombosis, and arterial restenosis following balloon 
angioplasty. 

The CDR-grafted antibodies of the present 

10 invention are useful in the same manner as comparable 
therapeutic agents, and the dosage level is of the same 
order of magnitude as is generally employed with those 
comparable therapeutic agents . The present antibodies 
may be administered in combination with a 

15 pharmaceutically acceptable carrier by methods known to 
one of ordinary skill in the art. 

Another embodiment of the present invention is 
directed to a pharmaceutical composition comprising a 
least one CDR-grafted antibody capable of inhibiting 

20 human tissue factor and further comprising a 

pharmaceutical ly acceptable carrier. As used herein, 
"pharmaceutically acceptable carrier" includes any and 
all solvents, dispersion media, coatings, antibacterial 
and antifungal agents, isotonic and absorption delaying 

25 agents, and the like* The use of such media and agents 
for pharmaceutically active substances is well-known in 
the art. Except insofar as any conventional media or 
agent is incompatible with the active ingredient, its 
use in the therapeutic compositions is contemplated. 

30 ^ Supplementary active ingredients can also be 
incorporated into the compositions. 
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The antibodies can be administered by well- 
1 known routes including oral and parenteral, e.g., 
intravenous , intramuscular, intranasal, intradermal, 
subcutaneous, and the like. Parenteral administration 
and particularly intravenous administration is 
5 preferred. Depending on the route of administration, 
the pharmaceutical composition may require protective 
coatings . 

The pharmaceutical forms suitable for 
injectionable use include sterile aqueous solutions or 

10 dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or 
dispersions. In all cases the ultimate solution form 
must be sterile and fluid. Typical carriers include a 
solvent or dispersion medium containing, for example, 

15 water buffered aqueous solutions (i.e., biocompatible 
buffers), ethanol, polyol such as glycerol, propylene 
glycol, polyethylene glycol, suitable mixtures thereof, 
surfactants or vegetable oils. The antibodies may be 
incorporated into liposomes for parenteral 

20 administration. Sterilization can be accomplished by an 
art-recognized techniques, including but not limited to, 
addition of antibacterial or antifungal agents, for 
example, paraben, chlorobutanol, phenol, sorbic acid or 
thimersal. Further, isotonic agents such as sugars or 

25 sodium chloride may be incorporated in the subject 
compositions . 

Production of sterile injectable solutions 
containing the subject antibodies is accomplished by 
incorporating these antibodies in the required amount in 

30 the appropriate solvent with various ingredients 
enumerated above, as required, followed by 



35. 



WO 96/40921 PCT/US96/09287 

-29- 

sterilization,. preferably filter sterilization. To 
1 obtain a sterile powder, the above solutions are vacuum-, 
dried or freeze-dried as necessary. 

The following examples further illustrate the 
present invention. 
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EXAMPLE 1 

1 Isolation and Sequencing of TF8-5G9 

Light Chain (LC) and Heavy Chain (HC) 

Two DNA libraries were generated from oligo 
5 (dT) -primed TF8-5G9 hybridoma RNA utilizing standard 
molecular biology procedures as described by Sambrook et 
al. The cDNA was cloned into the Librarian II plasmid 
vector from Invitrogen (San Diego, CA), and the 
libraries were screened for cDNA clones encoding murine 

10 IgG HC and LC, A full-length cDNA clone for the heavy 
chain could not be isolated, despite the construction of 
two independent libraries. A random primed TF8-5G9 cDNA 
library was generated to obtain the missing 5* sequence 
of the heavy chain. Consequently, the heavy chain cDNA 

15 was in two pieces: a 5' clone of 390 nucleotides and a 
3* clone of 1392 nucleotides. The two HC clones overlap 
by 292 nucleotides. 

The HC and LC clones were completely sequenced 
by the dideoxy chain termination method of Sanger et al. 

20 (1977) Prbc. Natl. Acad. Sci. USA 74:5463. To verify 

the variable region sequence, sequence was obtained from 
PCR-amplified cDNA that had been synthesized from total 
TF8-5G9 hybridoma RNA. Total TF8-5G9 hybridoma RNA was 
isolated by the guanidinium thiocyanate method of 

25 Chrigwin et al. (1970) Biochemistry 18:5294. cDNA was 
synthesized using the Perkin Elmer (Norwalk, CT) GeneAmp 
RNA Polymerase Chain Reaction (PCR) kit with an oligo 
(dT) primer. Components of the same kit were used in 
the PCR to amplify the LC and HC variable regions using 

30 primers based on the sequence that had been obtained for 
the cDNA clones. The amplified variable region 
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fragments were gel-purified and sequenced according to 
1 the method of Tracy et al. (1991) BioTechnioues 11:68 o 
a Model 373A Applied Biosystems, Inc. (Foster City, CA) 
automated fluorescent DNA sequencer. The sequence for 
TF8-5G9 LC and HC obtained from RNA amplification and 
5 the sequence obtained from the cDNA clones agreed. The 
TF8-5G9 HC variable region sequence with protein 
translation is shown in Figure 1 and SEQ ID NO:l, and 
that for the LC is shown in Figure 2 and SEQ ID NO: 3. 
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EXAMPLE 2 

1 Chimeric LC and HC Expression Vector Construction 

In order to test the binding activity of the 
CDR-grafted anti-TF LC and HC individually, mouse-human 
5 chimeric TF8-5G9 LC and HC were constructed. This 

allowed the CDR-grafted LC to be tested for TF binding 
ability in combination with the chimeric HC, and the 
CDR-grafted HC to be tested in combination with the 
chimeric LC. 

10 Primers were designed to amplify the TF8-5G9 

LC variable region using as template cDNA clones in the 
Librarian II vector. The 5* primer was designed with an 
EcoRI site while the 3' primer was designed with a Narl 
site. PCR was used to amplify the LC variable region, 

15 generating a 433 bp fragment with a 5 * Eco RI end and 

3 ' Nar l end. The fragment included the signal sequence 
from the TFS— 5G9 LC cDNA clone but incorporated a 2 base 
change in the argihine codon immediately following the 
ATG start codon. This change retained the arginine 

20 residue but made the sequence conform to the Kozak 
consensus sequence in order to potentially improve 
translation of the LC mRNA. The PCR amplified LC 
variable region fragment was digested with Eco RI and 
Nar l restriction enzymes and purified by electrophoresis 

25 on a 2% Nusieve, 1% Seakem agarose gel (FMC Bio 
Products, Rockland, ME). 

The DNA was extracted from the gel slice and 
purified by the Geneclean (Bio 101, La Jolla, CA) 
procedure. The full-length chimeric TF8-5G9 LC gene was 

30 generated by cloning this DNA into the Eco RI and Nar l 
sites of a pSP73 vector (Promega, Madison, WI) which 
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contains the human kappa constant region. The gene was 
1 isolated from the pSP73 vector by Eco RI digestion and 
subcloned into the Eco RI site of the pSG5 mammalian cell 
expression vector (Stratagene Cloning Systems, La Jolla, 
CA) 

5 The chimeric TF8-5G9 HC gene was assembled in 

a manner similar to that of the chimeric LC. Since 
there was no full-length HC cDNA isolated from the 
Librarian II vector cDNA libraries, the HC variable 
region fragment that was generated by the PCR from total 

10 TF8-5G9 hybridoma cell RNA was used as the template, 
Primiers which incorporated an Eco RI site at the 5 1 end 
and a Sac I site at the 3 1 end were used in the PCR to 
generate a 430 bp fragment which contained the TF8-5G9 
HC Kozak sequence, start codon, signal sequence, and 

15 variable region. This fragment was digested with the 
restriction enzymes Eco RI and Sad , and gel-purified 
using the same procedure that was used with the chimeric 
LC construction. 

The full-length TF8-5G9 chimeric HC gene was 

20 constructed by cloning the variable region fragment into 
the Eco RI and SacI sites of the pSG5 expression vector 
containing the human I gG4 constant region. 

25 
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EXAMPLE 3 

1 Design and Construction of the 

CDR-Grafted Heavy and Light Chain Genes 

The variable region domains of the CDR-grafted 
5 HC and LC genes were designed with an Eco RI overhang at 
the 5' end followed by a Kozak sequence to improve 
antibody expression. The leader sequences were derived 
from the heavy and light chains of the murine monoclonal 
antibody B72.3 (Whittle et al. (1987) Protein 

10 Engineering 1^:499). The 3' end of the variable regions 
were designed to have overhangs which allowed for 
splicing to the appropriate human constant region DNA. 

In the initially designed CDR-grafted TF8-5G9 
heavy and light chains the CDRs were derived from murine 

15 TF8-5G9 sequence while the frameworks were derived 
primarily from human antibody sequence. The human 
antibody KOL (Schmidt et al . ) was used for the heavy 
chain frameworks, while the human antibody dimer (Epp et 
al.) was used for the light chain frameworks. 

20 Several criteria were used to select murine 

framework residues in the design of the TF8-5G9 CDR- 
grafted heavy and light chain variable regions. 
Framework residues which, at a particular position, are 
idiosyncratic to TF8-5G9 were retained as murine 

25 sequence with the assumption that they contributed to 
its unique binding characteristics. TF8-5G9 murine 
residues were also retained at framework positions where 
they were in agreement with the human consensus sequence 
but where the corresponding residues in KOL or REI were 

30 idiosyncratic. Residues that are part of antibody loop 
canonical structures such as residue 71 (numbering 
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according to Kabat et al.) of the heavy and light chains 
1 were also retained as murine sequence. Framework 

residues that form loops such as residues 26-30 of the 
HC were kept as TFB-5G9 murine sequence at positions 
were the murine sequence differed from the human. 
5 Residues known to directly influence the conformation of 
CDRs, such as 48 and 49 immediately preceding CDR2 of 
the HC, were also retained as murine sequence. 

The amino acid sequence of the variable region 
for the initially designed CDR-grafted TF8-5G9 .HC, 

10 TF8HCDR1, is shown in SEQ ID NO: 11. Murine residues 
were retained at framework positions 6, 17, 23, 24, 28/ 
29, 30, 48, 49, 68., 71, 73, 78 88 and 91. The CDR- 
grafted HC variable region was attached to a human IgG4 
constant region. 

15 - The amino acid sequence of the variable region 

for the initially designed CDR-grafted TF8-5G9 LC, 
TF8LCDR1, is shown in SEQ ID NO: 12. Murine residues 
were retained at framework positions 39, 41, 46 and 105. 
The CDR-grafted LC variable region was attached to a 

20 human kappa constant region. 

The variable region for the CDR-grafted HC and 
LC described above were each assembled from 13 synthetic 
oligonucleotides which were synthesized by Research 
Genetics, Inc., Hunts ville, AL. These oligonucleotides 

25 ranged in length from 42 to 80 bases, and encoded both 
variable region strands. When the 6 complementary 
oligonucleotide pairs were annealed, the overhangs 
generated were 17 to 24 bases in length. These 
oligonucleotide pairs were combined, annealed at their 

30 complementary overhangs, and ligated to give the final 
full length double-stranded variable regions. 
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The HC variable region oligonucleotides were 
1 assembled into a 452 bp fragment which contains a 5' 
EcoRI site and a 3' Sac I site. The polymerase chain 
reaction was used to amplify this fragment. The 
resulting amplified DNA was purified on a 2% Nusieve, 1% 
5 Seakem agarose gel (FMC) . The appropriate size band of 
DNA was excised and the DNA was recovered by the 
Geneclean (Bio 101) procedure. The fragment was then 
digested with Eco RI and Sac I, and purified again by the 
Geneclean method; This HC variable region fragment with 
Eco RI and Sac I ends was cloned into the Eco RI and Sac I 
sites of the pSport-1 vector (GIBCO-BRL Life 
Technologies, Gaithersburg, MD) . DNA from several 
clones was isolated and sequenced to verify proper 
variable region assembly. All clones had unexpected 

15 base changes. One clone with the fewest base changes 
(two mismatches at bases 133 and 140) was selected to be 
corrected by site-directed mutagenesis according to 
Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488. 
Briefly, CJ236 (ung-, dut-) competent cells (Invitrogen 

20 Corporation, San Diego, CA) were transformed with the 
pSport vector containing the CDR-grafted HC variable 
region with the two base mismatch. Single-stranded, 
uridine- incorporated DNA templates were purified from 
phage following M13 helper phage (Stratagene Cloning 

25 Systems) infection of the transformed cells. 

Mutagenesis oligos containing the desired base changes 
were synthesized oh an Applied Biosystems Model 380B DNA 
synthesizer. The mutagenesis oligos were annealed to 
the template DNA, and T7 DNA Polymerase and T4 DNA 

30 Ligase (MutaGene InVitro Mutagenesis Kit, Bo-Rad 

Laboratories, Richmond, CA) were used to incorporate the 
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oligo into a newly synthesized DNA strand. DH5a 
1 competent cells (GIBCO-BRL Life Technologies) were 

transformed with the double-stranded DNA. The original 
uridine-incorporated strand is destroyed while the newly 
synthesized strand containing the. mutagenesis oligo is 
5 replicated. Phagemid DNA was prepared from the 

resulting mutagenesis clones and the variable regions 
were sequence to identify the clones which had 
incorporated the desired changes. The corrected HC 
Eco RI/ Sac I variable region fragment was excised from the 
10 pSport vector, purified and ligated into the Eco RI/ Sac I 
sites of a pSG5 vector containing the human IgG4 
constant region. This resulted in the generation of a 
full-length humanized TF8-5G9 HC gene, TF8HCDR1, in the 
pSG5 COS cell expression vector. The vector was 
15 designated pSG5TF8HCDRl . 

The CDR-graf ted TF8-5G9 LC variable region was 
also amplified by the PCR from the assembled synthetic 
oligonucleotides into a 433 bp fragment which contained 
a 5* EcoR I site and a 3* Narl site. This fragment was 
20 purified as described above for the HC, digested with 
Eco RI and Nar l and purified by the Geneclean procedure. 
This fragment was cloned into the Eco RI and Nar l sites 
of a pSG5 vector which contains the human kappa constant 
region. This resulted in the generation of a full- 
25 length humanized TF8-5G9 LC gene, TF8LCDR1, in the pSGS 
COS cell expression vector. Seven clones were 
sequenced, and one was found to have the desired CDR- 
graf ted LC sequence. The vector was designated 
pSQ5TF8LCDRl . 
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EXAMPLE 4 

1 Expression of the CDR-Grafted 

Heavy and Light Chain Genes in COS Cells 

The transient expression of antibody genes in 
5 COS-1 cells provides a rapid and convenient system to 
test antibody gene expression and function. COS-1 cells 
were obtained from the American Type Culture Collection 
(CRL 1650) and cultured^in Dulbecco's Modified Eagle 
Medium (DMEM, from GIBCO BRL Life Technologies) with 10% 

10 fetal calf serum. The pSG5TF8HCDRl expression factor 
was cotransfected into COS cells with the pSG5 chimeric 
LC expression vector using the DEAE-Dextran method 
followed by DMSO shock as described by Lopata et al. 
(1984) Nucleic Acids Res. 14:5707. After 4 days of 

15 culture, media was harvested from the wells and examined 
for antibody expression levels . 

Antibody levels were determined by an ELISA- 
based assembly assay. Plates were coated with a goat 
anti-human Fc specific antibody. Various dilutions of 

20 the COS cell supernatant containing secreted antibody 
were added, incubated for one hour, and washed. A 
horseradish peroxidase-linked goat anti-human kappa 
chain antibody was added, incubated for one hour at room 
temperature, and washed. Substrate for the horseradish 

25 peroxidase was added for detection. Antibody levels in 
the COS cell media were found to be nearly undetectable 
for the TF8HCDR1 x chimeric LC. Upon closer examination 
of the TF8HCDR1 variable region sequence, it was found 
that an unexpected base change, which had occurred 

30 during the site-directed mutagenesis process described 
in Example 3, introduced a stop codon into framework 4 



35 



WO 96/40921 



-39- 



PCT/US96/09287 



of the TF8HCDR1 gene. This substitution was corrected 
1 by site-directed mutagenesis as described above. 

Thorough sequencing of the variable region confirmed 
that the correction was made with no additional changes 
introduced. Upon transfection of this corrected 
5 TF8HCDR1 gene with the chimeric LC, reasonable 
expression levels were obtained. 

COS cells which had been co-transf ected with 
the CDR-grafted LC expression vector, pSGTF8LCDRl, and 
either the chimeric HC or TF8HCDR1 , produced antibody at 
10 reasonable levels. Antibody levels in COS cell 
supernatant s ranged from 0.5 jig to 10.0 uq per ml. 
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EXAMPLE 5 

1 Binding of the CDR-Grafted TF8-5G9 to Tissue Factor 

An ELISA was used to determine the ability of 
the CDR-grafted TF8-5G9 antibody, TF8HCDR1 x TF8LCDR1, 
5 to bind to tissue factor. Tissue factor was immobilized 
on a microtiter plate. The test COS cell supernatant, 
containing the CDR-grafted antibody, was added to the 
well, incubated for one hour at room temperature. 
Following three washes with PBS/Tween, a goat anti-human 

10 kappa chain polyclonal antibody conjugated to 

horseradish peroxidase was added, incubated for one hour 
at room temperature and washed. Substrate for the 
horseradish peroxidase was added for detection. The 
positive control was the TF8-5G9 chimeric antibody. The 

15 CDR-grafted TF8-5G9 antibody was able to bind to tissue 
factor to a degree comparable to the chimeric TF8-5G9 
antibody (Figure 3, solid symbols). 

The ability of the humanized antibody to 
compete with murine TF8-5G9 for binding to tissue factor 

20 was also examined. Varying amounts of COS cell 

supernatant containing the test CDR-grafted antibody and 
a fixed amount of murine TF8-5G9 were added 
simultaneously to wells coated with tissue factor. 
Binding was allowed to occur for one hour at room 

25 temperature. The wells were washed three times with 
PBS/Tween. A goat anti-human kappa chain antibody 
conjugated to horseradish peroxidase was added, 
incubated for one hour at room temperature and washed* 
Substrate for the horseradish peroxidase was added for 

30 detection. The positive antibody competed as well as 
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the chimeric antibody with murine TF8-5G9 for binding to 
1 TF. 

These data indicate that the initially 
designed CDR-grafted antibody, TF8HCDR1 x TF8LCDR1, was 
approximately as active as the chimeric TF8-5G9 in 
5 binding to TF and competing with the murine antibody for 
binding to TF. 
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EXAMPLE 6 

1 Construction and Characterization 

of Additional CDR-Grafted Heavy Chains 

Upon examination of the molecular structure of 
5 murine TF8-5G9, framework residues at positions 27, 68, 
73 and 78 were found to lie on the antibody surface and 
had no discernible contact with the CDRs. These 
framework residues were of murine sequence in TF8HCDR1 
but were changed to the human KOL sequence in various 

10 combinations to generate a series of CDR-grafted heavy 
chains with framework residue variations. The changes 
were made by the process of site-directed mutagenesis as 
described in Example 3. Each CDR-grafted heavy chain 
version was expressed in COS cells in combination with 

15 the CDR-grafted LC, TF8LCDR1, and tested for its ability 
to bind TF and compete with murine TF8-5G9 for binding. 
Every version of the CDR-grafted heavy chain in 
combination with TF8LCDR1 was shown to bind TF with an 
affinity comparable to chimeric TF8-5G9. Every CDR- 

20 grafted HC in combination with TF8LCDR1 was able to 
compete with murine TF8-5G9 for binding to TF to a 
degree comparable to the chimeric antibody. 

Changes in sequence from murine to human for 
HC framework positions 6, 7, 68, 73 and 78 did not 

25 adversely affect the antigen binding ability of the 
antibody. The CDR-grafted HC version which had human 
sequence at all of these positions, and thus was the 
most humanized HC, was TF8HCDR20. 

The complete sequence of the TF8HCDR20 gene 

30 was determined. The DNA sequence is shown as a 2360 bp 
EcoRI/BamHI insert with protein translation in the 
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pEe6TF8HCDR20 expression vector in Figure 4 and SEQ ID 
NO: 15. 

The essential regions of the gene are as 

follows: 

Region 

5' Eco RI restriction site 
Kozak sequence 

Start codon and leader sequence 

CDR-grafted variable region 

Human IgG4 CHI domain 

Human IgG4 intron 2 

Human IgG4 hinge 

Human IgG4 intron 3 

Human IgG4 CH2 domain 

Human IgG4 intron 4 

Human IgG4 CH3 domain 

3' untranslated region 

3 ' BamH I end spliced to Bel l 
site of the expression vector 



Nucleotide # 
5 1-6 
7-15 
16-72 
73-423 
424-717 
10 718-1110 
1111-1146 
1147-1267 
1268-1594 
1595-1691 
15 1692-2012 
2013-2354 
2355-2360 
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EXAMPLE 7 

1 C nstruction and Characterization 

of Additional CDR-Grafted Light Chains 

The initially designed CDR-grafted LC, 
5 TF8LCDR1, contained four framework residues from the 
murine TF8-5G9 sequence. At two of these positions, 39 
and 105, the human RE I framework sequence is unique to 
REI; however, the murine TF8-5G9 LC sequence is in 
agreement with the human consensus sequence. The other 

10 two murine framework residues, trp41 and thr46, are 

unique to TF8-5G9. Several versions of the CDR-grafted 
LC were generated in which the sequence at these four 
positions were changed from the murine to the human REI 
in various combinations . These changes were made by 
. 15 site-directed mutagenesis . Each version of the CDR- 
grafted LC was expressed in COS cells in combination 
with the CDR-grafted HC, TF8HCDR20, and tested for 
ability to bind tissue factor and compete with murine 
TF8-5G9 for binding. Every version of the CDR-grafted 

20 LC, in combination with TF8HCDR20, was shown to bind TF 
with an affinity comparable to TF8-5G9. Also every CDR- 
grafted LC version, in combination with TF8HCDR20, was 
able to compete with murine TF8-5G9 for binding to TF in 
a manner comparable to the chimeric TF8-5G9 control. 

25 Changes in sequence from murine to human for 

LC framework positions 39, 41,- 46 and 105 did not 
adversely effect the ability of the antibody to 
recognize antigen. The CDR-grafted LC of choice was 
TF8LCDR3 , where murine TF8-5G9 sequence was used at 

30 positions 39 and 105 because these are in agreement with 
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the human consensus sequence. The preferred CDR-grafted 

1 TF8-5G9 antibody is TF8HCDR20 x TF8LCDR3. 

The complete sequence of the TF8LCDR3 gene was 

determined and is shown as a 759 bp Eco RI- Bam HI insert 

with protein translation in the pEe 12TF8LCDR3 expression 

5 vector in Figure 5 and SEQ ID NO: 17. The essential 

regions of the gene are as follows: 

Nucleotide # Region 

1-5 5" Eco RI restriction site 

6-8 Kozak sequence 

10 9-68 Start codon and leader sequence 

69-392 CDR-grafted variable region 

393-710 Human kappa constant region 

711-753 3' untranslated region 

754-759 3' BamH I end spliced to Bel l 

^ej site of the expression vector 
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EXAMPLE 8 

1 CDR-Grafted TF8-5G9 Antibody TF8HCDR20 x TF8LCDR3 

Inhibits Human Tissue Factor 

The binding of the CDR-grafted TF8-5G9 
5 antibody, TF8HCDR20 x TF8LCDR3, to TF was assessed as 
described in Example 5 and was found to be comparable to 
that of the chimeric TF8-5G9 as illustrated in Figure 6. 
The ability of the CDR-grafted TF8-5G9 to compete with 
the murine antibody for binding to TF is comparable to 
10 that of the chimeric TF8-5G9 as shown in Figure 7. 

An in vitro assay was used to measure the 
level of inhibition of factor X activation by the CDR- 
grafted TF8-5G9 antibody. In this assay, TF forms an 
active proteolytic complex with factor VII. This 
15 complex then converts factor X to factor Xa by 

proteolysis . The activated Xa enzymatically cleaves a 
substrate, Spectrozyme FXa, which releases a chromogen. 
The level of chromogen, as detected by optical density, 
is an indication of factor X activation due to TF- factor 
20 Vila activity. 

The following reaction mixtures were prepared 
in 12 x 75 mm borosilicate glass tubes. 

25 ill TBS (50 mM Tris, pH 7.4, 150 mM NaCl) 
15 ill 20 mM CaCl 2 /l% bovine serum albumin 

25 (BSA) 

20 fil human placental tissue factor solution 
(prepared by reconstituting one vial of 
Thromborel S, Curtin Matheson Scientific 
#269-338 with 4.0 ml dH 2 0 and diluting 
30 1:10 in TBS) 
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30 jil Factor VII (Enzyme Research Labs #HFVII 
1 1007 at 237.66 ng/ml in TBS) 

30 Ail TBS or TF8-5G9 or TF8MCDR20 x TF8LCDR3 
at 1.18 jig/ml or as indicated in Fig. 8 
The reaction mixtures were incubated at 37 °C 
5 for ten minutes before the addition of Factor X. (In 
some cases the reaction mixture was preincubated for 
five minutes before addition of Factor VII or antibody, 
followed by a ten minute incubation before addition of 
Factor X.) Thirty vl of Factor X solution (Enzyme 
10 Research Labs, DHFX 330, 247.38 jig/ml TBS) was added and 
the mixture was incubated at 37 °C for three minutes. 
Factor X activation was terminated by pipetting 40 uq of 
reaction mixture into 160 yxl of stop buffer (50 mM Tris, 
pH 7.4, 100 mM EDTA, 150 mM NaCl) in 96 well microtiter 
15 plates . Each tube of reaction mixture was pipetted into 
three microtiter wells. Fifty jil of Spectrozyme FXa 
substrate (American Diagnostics #222 - luM/ml TBS) was 
added to each well. OD 405 was read on a Molecular 
Devices, kinetic plate reader with readings taken every 
20 twenty seconds for ten minutes. Factor X activity was 
recorded as mOD/minute, and enzyme velocities over the 
linear portion of the reaction curve were compared to 
determine inhibition of factor X activation by the anti- 
TF antibodies. 

25 As shown in Figure 8, the CDR-grafted TF8-5G9 

antibody is approximately as effective as the murine 
TF8-5G9 in inhibiting factor X activation. This 
indicates that the CDR-grafted TF8-5G9 is functionally 
active . 
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EXAMPLE 9 

1 C nstruction of the CDR-Grafted Heavy 

and Light Chain Myeloma Expression Vectors 

For the purpose of establishing a permanent 
5 CDR-grafted antibody-producing cell line, the TF8HCDR20 
and TF8LCDR3 genes were subcloned into myeloma cell 
expression vectors. The heavy chain TF8HCDR20 was 
subcloned into the Eco RI and Bel l sites of the pEe6hCMV- 
Bql ll myeloma expression vector described by Stephens et 

10 al. (1989) Nucleic Acids Res. 17:7110 to produce 

pEe6TF8HCDR20. The light chain TF8LCDR3 was subcloned 
into the EcoTI and Bel l sites of the pEe!2 myeloma 
expression vector to produce pEe!2TF8LCDR3 . The heavy 
and light chain expression vectors are illustrated in 

15 Figures 9 and 10, respectively. In both vectors 
antibody gene transcription was driven by the human 
cytomegalovirus (hCMV) promoter-enhancer, which lies 
directly 5' to the multiple cloning site. The 
polyadenylation signal sequence lies 3* to the multiple 

20 cloning site and signals the termination of 

transcription. Each vector contains the fl-lactamase 
gene to allow for ampicillin selection in E. coli . The 
pEe!2 vector contains a glutamine synthetase cDNA gene 
under the transcriptional control of the SV40 early 

25 promoter. Glutamine synthetase allows for myeloma cell 
transf ectants to be selected in glutamine- free media. 
Myeloma cells are devoid of glutamine synthetase 
activity and are dependent on a supply of glutamine in 
the culture media. Cells which have been transf ected 

30 with the pEel2 vector, containing the glutamine 
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synthetase gene, are able to synthesize glutamine from 
glutamate and can survive in the absence of glutamine. 

The pEe6TF8HCDR2 0 expression vector is a 7073 bp 
plasmid whose DNA sequence is shown in Figure 4 and SEQ 
ID NO: 15. The coding regions of the TF8HCDR20 gene are 
translated. The essential regions of this vector are 
described below: 

1. Nucleotides #1-2360: The TF8HCDR20 CDR- 
grafted HC gene is described in Example 
6 . The HC gene was inserted as an 
Eco RI/ BamH I fragment into the Eco RI/ Bcl I 
sites of the pEe6hCMV-BglII vector. 

2. Nucleotides #2361-2593: This region 
encodes the SV40 early gene 
polyadenylation signal (SV40 nucleotides 
2770-2537), which acts as a 
transcriptional terminator. This 
fragment is flanked by a 5' Bel l site and 
a 3' Bam HI site. The 3' BamH I end of the 
heavy chain gene was spliced to the 5' 
Bel l site of the polyadenylation signal, 
thus eliminating both sites. 

3. Nucleotides #2594-3848: This region is a 
BamH I- Bql l fragment from pBR328 
(nucleotides 375-2422) but with a 
deletion between the Sa l and Ava l sites 
(pBR328 nucleotides 651-1425) following 
the addition of a Sai l linker to the Ava l 
site. This region contains the Col El 
bacterial origin of replication. 

4. Nucleotides #3849-4327: This is a Bqll- 
Xmnl fragment site from the ^-lactamase 
gene of pSP64 (Promega Corporation, 
Madison, WI) . This gene provides 
ampicillin resistance to bacteria 

, transformed with this vector. 

5. Nucleotides #4328-4885: This is an Xmnl- 
Hindlll fragment of the ColEl based 
plasmid pCT54 described by Emtage et al . 
(1983) Proc. Natl. Acad. Sci. USA 
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80:3671. The Hindlll site was converted 
to a Bql ll site by the addition of a 
linker following the addition of the hCMV 
promoter described below. 



6. Nucleotides #4886-7022: These 
nucleotides encode the Pst-lm fragment of 

^ human cytomeglovirus (hCMV) strain AD 169 

described by Greenway et al. (1982) Gene 
18^:355 containing the region coding for 
the hCMV middle intermediate early 
promoter. . This Pst -lm fragment was 
cloned into the Hindi I I site of pEe6hCMV 
by addition of oligonucleotides of the 
following sequence to either end of the 
10 fragment: 

5 • GTCACCGTCCTTGACACGA 3 • 

3' ACGTCAGTGGCAGGAACTGTGCTTCGA 5 1 

The resulting 2100 bp fragment was 
25 inserted such that the promoter directed 

transcription towards the Eco RI site of 
pEe6hCMV\ The oligonucleotide above 
served to recreate the complete 5 9 
untranslated sequence of the hCMV-MIE 
gene the added irrelevant sequence at the 
very 5' end of the fragment . The Hind i I I 
site at the 5' end was subsequently 
20 converted to a Bql ll site by the addition 

of a further linker. 

7. Nucleotides #7023-7073: The pSP64 
polylinker with the BamH I and Sail sites 
removed. 

25 The pEe 1 2TF8LCDR3 expression vector is a 7864 

bp plasmid whose DNA sequence is shown in Figure 5 and 

SEQ ID NO: 17. The coding regions of the TF8LCDR3 gene 

are translated. The essential regions of this 

expression vector are described below: 

3 0 1. Nucleotides #1-759: The TF8LCDR3 CDR- 

grafted LC gene is described in Example 
7 . The gene was inserted as an 
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EcoRI/BamHI fragment into the Eco Rl/ Bcl Il 
sites of the pEel2 expression vector. 

Nucleotides #760-3284: These regions of 
pEe!2 are identical to the regions 
encoded by nucleotides 2361-4885 of the 
pEe6TF8HCDR2 0 vector described above 
(regions #2-5) . 

Nucleotides #3285-5736: This region 
encodes the Chinese hamster ovary 
glutamine synthetase cDNA under the 
transcriptional control of the SV40 early 
promoter and followed by the SV40 
polyadenylation and splice signals from 
the pSV2.dhfr vector described by 
Subramani et al. (1981) Mol. Cell. Biol. 
1:854. The following describes the 
derivation of this region: A 1200 bp 
Nael-PyuII fragment, containing a 
complete GS coding sequence, was excised 
from the Chinese hamster ovary cDNA clone 
AGS1.1 described by Hayward et al. (1986) 
Nucleic Acid Res. 14:999. After addition 
of a Hindi II linker to the Nae l site and 
a Bql ll linker to the PyuII site (hence 
destroying the Nae l and PyuII sites), the 
1200 bp fragment was cloned in place of 
DHFR sequences in pSV2.dhfr between the 
Hind i I I and Bglll sites to form pSV2.GS. 
The single remaining PyuII site in 
pSV2 BamG S was converted to a BamHl site 
by addition of an oligonucleotide linker 
to form pSV2BamGS. An Eco RI site in the 
GS cDNA was destroyed by site directed 
mutagenesis without altering the amino 
acid sequence in pSV2 Bam GS and the 
Hind i I I site was destroyed by filling in 
with DNa polymerase I. The 2451 bp BamH I 
fragment from this plasmid, containing 
the complete SV40-GS hybrid transcription 
unit, was excised and inserted at the 
Bglll site of pEe6hCMV-BqlII site of 
pEe6hCMV- Bql II such that transcription 
from the sV40 early promoter proceeds 
towards the hCMV promoter. 
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4. Nucleotides #5737-7864: This region is 
^ identical to the hCMV promoter and pSP64 

polylinker encoded by nucleotides 4886- 
7073 of the pEe6TF8HCDR20 vector 
described above (regions 6 and 7). 

For the purpose of ensuring that both the 

c- pEe6TF8HCDR20 and peE12TF8LCDR3 vectors co- trans f ected 

myeloma cells, the vectors were joined in linear 

concatamers. Both the pEe6TF8HCDR20 and pEel2TF8LCDR3 

vectors were digested at the unique Sai l site. The Sai l 

linearized pEe6TF8HCDR2 0 vector was phosphatased at its 

^ Q 5' ends to prohibit ligation of two pEe6TF8HCDR20 

vectors onto each other. This phosphatased HC vector 

was ligated in a 2 : 1 molar ratio to the Sal linearized 

pEel2TF8LCDR3 . The resulting concatamers were most 

likely of the following composition: 

Sai l sail sail sail 



15 



pEe6TF8HCDR2 0 



pEel2TF8LCDR3 



pEe6TF8HCDR20 



This concatamerized DNA was extracted with phenol and 
chloroform, and precipitated with ammonium acetate and 
20 ethanol. The DNA precipitate was resuspended in 

distilled water to a concentration of 1 pg/pL and used 
to transfect myeloma cells* 
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EXAMPLE 10 

1 Development of NSO Expression Cell Lines 

Stably transformed cell lines expressing the 
humanized TF8-5G9 antibody were prepared by transfecting 
5 CDR-grafted heavy and light chain expression vectors 
into NSO mouse myeloma cells. Selection of transfected 
cells was carried out using the dominant selectable 
marker gene, glutamine synthetase (GS). 

The NSO mouse myeloma cell line, obtained from 
10 Celltech, Ltd., is a subclone derived from NS-1 and does 
not express intracellular light chains. These cells 
were cultured in Dulbecco's modified Eagle's medium 
(DMEM) with added glutamine and 10% fetal bovine serum 
( FBS ) - To prepare for transf ection, the cells were 
15 harvested in mid-log phase of the growth cycle, 
centrifuged for 5 minutes, washed with phosphate 
buffered saline (PBS), centrifuged again, and the cell 
pellet was resuspended in 2.2 mi of PBS. The final cell 
concentration was 2.18 x 10 7 mL. Cells were maintained 
20 on ice during the entire procedure. 

The DNA to be transfected (pEel2TF8LCDR3 x 
pEe6TF8HCDR2 0 ) was prepared as a concatamer as described 
in Example 9. The DNA and NSO cells were added to a 0.4 
cm BioRad Gene Pulser cuvette in the following order: 
25 40 pL (40 j/g) DNA concatamer 

320 fiL double distilled water 
40 iiL 10 x PBS 

400 jiL NSO cells (8.72 x 10 6 cells) 
Transf ection was performed by electroporation 
30 following a protocol provided by Celltech, Ltd. In this 
procedure, the cells and DNA in PBS buffer were exposed 
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to a brief, high voltage pulse of electricity causing 
1 transient micropores to form on the cell membrane, DNA 
transfer takes place through these openings. To prepare 
for electroporation, the suspension of NSO cells and DNA 
was gently mixed and incubated on ice for 5 minutes . 
5 The cuvette was placed in a BioRad Gene Pulser and given 
2 consecutive electrical pulses at settings of 3 jjF 
(capacitance) and 1.5V (voltage). Following 
electroporation, the cuvette was returned to the ice for 
5 minutes. The suspension was then diluted in prewarmed 

10 growth medium and distributed into seven 96-well plates. 
Control plates containing cells electroporated without 
DNA were also prepared at the same time to measure the 
presence of spontaneous mutants. Plates were placed in 
a 37 °C incubator with 5% C0 2 . 

15 Glutamine synthetase, encoded by the GS gene, 

is an enzyme that converts glutamate to glutamine. NSO 
ceils require glutamine for growth due to inadequate 
levels of endogenous GS gene expression. In the DNA 
concatamer, this gene is located on the pEel2TF8LCDR3 

20 vector. Trans fected cells which incorporate the GS gene 
become glutamine-independent . Cells not integrating the 
GS gene into their genome would remain glutamine- 
dependent and would not survive in glutamine- free 
medium. Approximately 18 hours post electroporation, 

25 plates were fed with glutamine- free selection medium 

and returned to the incubator until viable colonies 
appeared. 

Approximately 3 weeks after transf ection, 
distinct macroscopic colonies were observed. These were 
30 screened for expression of the intact humanized antibody 
using the assembly ELISA as described in Example 5. 
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Tissue culture supernatants from wells containing 
1 colonies were screened at a 1:10 dilution. Positive 
wells showing activity greater than the 25 ng/mL 
standard were subcultured and expanded for further 
analysis. 

5 For selection of high producers, antibody 

production was quantitated after a 96 hour growth 
period. Tissue culture flasks were seeded with 2 x 10 s 
cells/mL in 10 mL of selection medium and incubated at 
37°C, 5% C0 2 for 96 hours. At the end of that time 
10 period, an aliquot was taken to determine cell 
concentration and antibody titer. Evaluation of 
antibody production was calculated as />g/mL and 
pg/cell/96 hours. The highest producers from this 
trans feet ion were: 
15 Cell Line yq/mL pg/cell/96 hour 

2B1 26.3 24,3 

3E11 27.6 59.9 

4G6 30.2 41,9 
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EXAMPLE 11 

CDR Grafted Antibody TF8HCDR20 x TF8LCDR3 
Inhibits Tissue Factor In Vivo 



CDR grafted antibody TF8HCDR20 x TF8LCDR3 was 
5 compared to murine antibody TF8-5G9 for its ability to 
protect rats from experimentally induced disseminated 
intravascular coagulation (DIC) . In the DIC model, rats 
are challenged with human thromboplastin (a crude tissue 
extract containing TF activity) , resulting in fibrinogen 

10 consumption and death. Pretreatment of rats with anti- 
TF antibody was demonstrated to protect rats from 
fibrinogen consumption and death as follows. 

Human thromboplastin was prepared as described 
in U.S. Patent 5,223,427. Saline control or 30 ji/ml of 

15 TF8-5G9 or CDR-grafted antibody was injected through the 
tail vein of rats, followed by injection of 
thromboplastin equivalent to 200 ng of recombinant TF. 
Clotting times were determined at T=0 and T=l minute as 
a measure of fibrinogen concentration. Clotting times 

20 are proportional to fibrinogen concentration, with a 60 
second clotting time corresponding to an 80% reduction 
in fibrinogen concentration. Clotting times of greater 
than 60 seconds cannot be accurately measured and were 
recorded as 60 seconds. 

25 Survivability and clotting times for three 

representative studies are shown below. 

Survivors 

Study Controls TF8-5G9 CDR-grafted 

Ab 

on * 0/8 5/8 6/8 

^ U 2 0/8 4/7 7/8 

3 0/8 8/8 3/7 
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Clotting Times 
Controls 



Study #1 


Study #2 


T=0 


T=l 


T=0 T=l 


16 


>60 


18 >60 


16 


>60 


18 >60 


5 16 


>60 


18 >60 


17 


>60 


18 >60 


15 


>60 


16 >60 


16 


>60 


18 >60 


16 


>60 


17 >60 


16 


>60 


17 >60 



Study #3 
T=0 T=l 



19 
21 
18 
19 
18 
18 
18 
18 



>60 

>60 

>60 

>60 

54 

>60 

>60 

>60 



10 



15 



Study #1 
T=0 T=l 



16 
15 
15 
15 
15 
16 
16 
16 
16 



36 

41 

33 

31 

>60 

>60 

33 

33 

>60 



Clotting Times 
Murine TF8-5G9 

Study #2 
T=0 T=l 



18 
18 
18 
17 
18 
17 
17 
18 



34 

36 

>60 

>60 

50 

34 

34 

31 



Study #3 
T=0 T=l 



20 



19 
18 
19 
18 
18 
19 
19 
19 
19 



28 
29 
29 
29 
28 
40 
40 
34 
>60 



Clotting Times 
CDR-grafted TF8-5G9 





Study #1 


Study #2 




T=0 


T=l 


T=0 


T=l 


25 


16 


>60 


17 


>60 




16 


>60 


17 


33 




16 


>60 


18 


32 




22 


37 


18 


>60 




16 


32 


17 


32 




15 


>60 


18 


31 




16 


>60 


17 


31 


30 


16 


>60 


16 


32 



Study #3 
T=0 T=l 



21 
18 
17 
20 
17 
18 
18 



>60 

34 

>60 

35 

58 

33 

31 
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Twenty-three of the twenty- four control rats 
1 had clotting times of greater than 60 seconds indicating 
that virtually all untreated rats were consuming more 
than 80% of their fibrinogen. Both the CDR-grafted and 
murine antibody treated rats had similar clotting times 
5 at one minute of 44.5 and 40 seconds. Further, only six 
of the murine antibody treated rats and nine of .the CDR- 
grafted antibody treated rats had clotting times in 
excess of 60 seconds. Accordingly, both the murine and 
CDR-grafted antibodies were able to neutralize TF and 
10 thus protect rats from fibrinogen consumption and death. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Joliffe, Linda K. 

Zivin, Robert A. 
^ Pulito, Virginia L- 

(ii) TITLE OF INVENTION: CDR-G RAFTED ANTI-TISSUE FACTOR 
ANTIBODIES AND METHODS OF USE THEREOF 

(iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & PresBer 
-iq (B) STREET: 400 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(B) COUNTRY: United States 
(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 
fjk\ APPLICATION NUMBER: 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: < 

(viii)- ATTORNEY /AGENT INFORMATION: 
on (A) NAME: DiGiglio, Frank S. 

(B) REGISTRATION NUMBER: 31,346 

(C) REFERENCE / DOCKET NUMBER: 9598 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 230 901 SANS UR 

25 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 11.. 1391 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGTCCTTACA ATG AAA TGC AGC TGG GTC ATC TTC TTC CTG ATG GCA GTG 49 
Met Lys Cys Ser Trp Val lie Phe Phe Leu Met Ala Val 
1 5 10 

GTT ACA GGG GTC AAT TCA GAG ATT CAG CTG GAG CAG TCT GGG GCT GAG 97 
Val Thr Gly Val Asn Ser Glu He Gin Leu Gin Gin Ser Gly Ala Glu 
15 20 25 

CTT GTG AGG CCA GGG GCC TTA GTC AAG TTG TCC TGC AAA GCT TCT GGC 145 
Leu Val Arg Pro Gly Ala Leu Val Lys Leu Ser Cys Lys Ala Ser Gly 
30 35 40 45 

TTC AAC ATT AAA GAC TAC TAT ATG CAC TGG GTG AAG CAG AGG CCT GAA 193 
Phe Asn He Lys Asp Tyr Tyr Met His Trp Val Lys Gin Arg Pro Glu 
50 55 60 

CAG GGC CTG GAG TGG ATT GGA TTG ATT GAT CCT GAG AAT GGT AAT ACT 241 
Gin Gly Leu Glu Trp He Gly Leu He Asp Pro Glu Asn Gly Asn Thr 
65 70 75 

ATA TAT GAC CCG AAG TTC CAG GGC AAG GCC AGT ATA ACA GCA GAC ACA 289 
He Tyr Asp Pro Lys Phe Gin Gly Lys Ala Ser lie Thr Ala Asp Thr 
80 85 90 

TCC TCC AAC ACA GCC TAC CTG CAG CTC AGC AGC CTG ACA TCT GAG GAC 337 
Ser Ser Asn Thr Ala Tyr Leu Gin Leu Ser Ser Leu Thr Ser Glu Asp 
95 100 105 

ACT GCC GTC TAT TAC TGT GCT AGA GAT AAC TCG TAC TAC TTT GAC TAC 385 
Thr Ala Val Tyr Tyr Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr 
110 115 120 125 
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TGG GGC CAA GGC ACC ACT CTC ACA GTC TCC TCA GCC AAA ACG ACA CCC 433 
Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Pro 
1 130 135 140 

CCA TCT GTC TAT CCA CTG GCC CCT GGA TCT GCT GCC CAA ACT AAC TCC 481 
Pro Ser Val Tyr Pro Leu Ala Pro Gly Ser Ala Ala Gin Thr Asn Ser 
145 150 155 

ATG GTG ACC CTG GGA TGC CTG GTC AAG GGC TAT TTC CCT GAG CCA GTG 529 
5 Met Val Thr Leu Gly Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro Val 
160 165 170 

ACA GTG ACC TGG AAC TCT GGA TCC CTG TCC AGC GGT GTG CAC ACC TTC 577 
Thr Val Thr Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe 
175 180 185 

CCA GCT GTC CTG CAG TCT GAC CTC TAC ACT CTG AGC AGC TCA GTG ACT 625 
Pro Ala Val Leu Gin Ser Asp Leu Tyr Thr Leu Ser Ser Ser Val Thr 
10 190 195 200 205 

GTG CCC TCC AGC ACC TGG CCC AGC GAG ACC GTC ACC TGC AAC GTT GCC 673 
Val Pro Ser Ser Thr Trp Pro Ser Glu Thr Val Thr Cys Asn Val Ala 
210 215 220 

CAC CCG GCC AGC AGC ACC AAG GTG GAC AAG AAA ATT GTG CCC AGG GAT 721 

His Pro Ala Ser Ser Thr Lys Val Asp Lys Lys lie Val Pro Arg Asp 

225 230 235 

15 

TGT GGT TGT AAG CCT TGC ATA TGT ACA GTC CCA GAA GTA TCA TCT GTC 769 

Cys Gly Cys Lys Pro Cys lie Cys Thr Val Pro Glu Val Ser Ser Val 
240 245 250 

TTC ATC TTC CCC CCA AAG CCC AAG GAT GTG CTC ACC ATT ACT CTG ACT 817 
Phe lie Phe Pro Pro Lys Pro Lys Asp Val Leu Thr lie Thr Leu Thr 
255 260 265 

20 CCT AAG GTC ACG TGT GTT GTG GTA GAC ATC AGC AAG GAT GAT CCC GAG 865 
Pro Lys Val Thr Cys Val Val Val Asp lie Ser Lys Asp Asp Pro Glu 
270 275 280 285 

GTC CAG TTC AGC TGG TTT GTA GAT GAT GTG GAG GTG CAC ACA GCT CAG 913 
Val Gin Phe Ser Trp Phe Val Asp Asp Val Glu Val His Thr Ala Gin 
290 295 300 

ACG CAA CCC CGG GAG GAG CAG TTC AAC AGC ACT TTC CGC TCA GTC AGT 961 
25 Thr Gin Pro Arg Glu Glu Gin Phe Asn Ser Thr Phe Arg Ser Val Ser 
305 310 315 
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GAA CTT CCC ATC ATG CAC CAG GAC TGG CTC AAT GGC AAG GAG TTC AAA 1009 
Glu Leu Pro He Met His Gin Asp Trp Leu Asn Gly Lys Glu Phe Lys 
320 325 330 

TGC AGG GTC AAC AGT GCA GCT TTC CCT GCC CCC ATC GAG AAA ACC ATC 1057 
Cys Arg Val Asn Ser Ala Ala Phe Pro Ala Pro He Glu Lys Thr He 
335 340 345 

TCC AAA ACC AAA GGC AGA CCG AAG GCT CCA CAG GTG TAC ACC ATT CCA 1105 
Ser Lys Thr Lys Gly Arg Pro Lys Ala Pro Gin Val Tyr Thr He Pro 
350 355 360 365 

CCT CCC AAG GAG CAG ATG GCC AAG GAT AAA GTC AGT CTG AAC TGC ATG 1153 
Pro Pro Lys Glu Gin Met Ala Lys Asp Lys Val Ser Leu Asn Cys Met 
370 375 380 

ATA ACA GAC TTC TTC CCT GAA GAC ATT ACT GTG GAG TGG CAG TGG AAT 1201 
He Thr Asp Phe Phe Pro Glu Asp He Thr Val Glu Trp Gin Trp Asn 
385 390 395 

GGG CAG CCA GCG GAG AAC TAC AAG AAC ACT CAG CCC ATC ATG GAC ACA 1249 
Gly Gin Pro Ala Glu Asn Tyr Lys Asn Thr Gin Pro He Met Asp Thr 
400 405 410 



GAT GGC TCT TAC TTC GTC TAC AGC AAG CTC AAT GTG CAG AAG AGC AAC 1297 
Asp Gly Ser Tyr Phe Val Tyr Ser Lys Leu Asn Val Gin Lys Ser Asn 
415 420 425 

TGG GAG GCA GGA AAT ACT TTC ACC TGC TCT GTG TTA CAT GAG GGC CTG 1345 
Trp Glu Ala Gly Asn Thr Phe Thr Cys Ser Val Leu His Glu Gly Leu 
430 435 440 445 

CAC AAC CAC CAT ACT GAG AAG AGC CTC TCC CAC TCT CCT GGT AAA T 1391 
His Asn His His Thr Glu Lys Ser Leu Ser His Ser Pro Gly Lys 
450 455 460 



GATCCCAGTG TCCTTGGAGC CCTCTGGTCC TACAGGACTC TGACACCTAC CTCCACCCCT 1451 
CCCTGTATAA ATAAAGCACC CAGCACTGCC TTGGACCC 1489 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(iij MOLECULE TYPE: protein 

1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Cys Ser Trp Val lie Phe Phe Leu Met Ala Val Val Thr Gly 
15 10 15 

Val Asn Ser Glu lie Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg 
5 20 25 30 

Pro Gly Ala Leu Val Lys Leu Ser Cys Lys Ala Ser Gly Phe Asn lie 
35 40 45 

Lys Asp Tyr Tyr Met His Trp Val Lys Gin Arg Pro Glu Gin Gly Leu 
50 55 60 

Glu Trp lie Gly Leu lie Asp Pro Glu Asn Gly Asn Thr lie Tyr Asp 
10 65 70 75 80 

Pro Lys Phe Gin Gly Lys Ala Ser He Thr Ala Asp Thr Ser Ser Asn 
85 90 95 

Thr Ala Tyr Leu Gin Leu Ser Ser Leu Thr Ser Glu Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly Gin 
15 - 115 120 125 

Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Pro Pro Ser Val 
130 135 140 

Tyr Pro Leu Ala Pro Gly Ser Ala Ala Gin Thr Asn ser Met Val Thr 
145 150 155 160 

Leu Gly "Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro Val Thr Val Thr 
20 165 170 175 

Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe Pro Ala Val 
180 185 190 

Leu Gin Ser Asp Leu Tyr Thr Leu Ser Ser Ser Val Thr Val Pro Ser 
195 200 205 

Ser Thr Trp Pro Ser Glu Thr Val Thr Cys Asn Val Ala His Pro Ala 
25 210 215 220 
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Ser Ser Thr Lys Val Asp Lys Lys lie Val Pro Arg Asp Cys Gly Cys 
225 230 235 240 

1 

Lys Pro Cys lie Cys Thr Val Pro Glu Val Ser Ser Val Phe He Phe 
245 250 255 

Pro Pro Lys Pro Lys Asp Val Leu Thr lie Thr Leu Thr Pro Lys Val 
260 265 270 

5 Thr Cys Val Val Val Asp He Ser Lys Asp Asp Pro Glu Val Gin Phe 
275 280 285 

Ser Trp Phe Val Asp Asp Val Glu Val His Thr Ala Gin Thr Gin Pro 
290 295 300 

Arg Glu Glu Gin Phe Asn Ser Thr Phe Arg Ser Val Ser Glu Leu Pro 
305 310 315 320 

10 Ile Met His G1 -n Asp Trp Leu Asn Gly Lys Glu Phe Lys Cys Arg Val 
325 330 335 

Asn Ser Ala Ala Phe Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Thr 
340 345 350 

Lys Gly Arg Pro Lys Ala Pro Gin Val Tyr Thr lie Pro Pro Pro Lys 
355 360 365 

15 Glu Gin Met Ala Lys Asp Lys Val Ser Leu Asn Cys Met Ile Thr Asp 
370 375 380 

Phe Phe Pro Glu Asp Ile Thr Val Glu Trp Gin Trp Asn Gly Gin Pro 
385 390 395 400 

Ala Glu Asn Tyr Lys Asn Thr Gin Pro Ile Met Asp Thr Asp Gly Ser 
405 410 415 

20 T y r Pne Val Ser L Y 9 Leu Asn Val Gln L Y S Ser Aan Tr P Glu Ala 
420 425 430 

Gly Asn Thr Phe Thr Cys Ser Val Leu His Glu Gly Leu His Asn His 
435 440 445 

His Thr Glu Lys Ser Leu Ser His Ser Pro Gly Lys 
450 455 460 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 5.. 706 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGAC ATG CGG GCC CCT GCT CAG TTT TTT GGG ATC TTG TTG CTC TGG TTT 49 
Met Arg Ala Pro Ala Gin Phe Phe Gly He Leu Leu Leu Trp Phe 
1 5 10 15 

CCA GGT ATC AGA TGT GAC ATC AAG ATG ACC CAG TCT CCA TCC TCC ATG 97 
Pro Gly He Arg Cys Asp He Lys Met Thr Gin Ser Pro Ser Ser Met 
20 25 30 

TAT GCA TCG CTG GGA GAG AGA GTC ACT ATC ACT TGT AAG GCG AGT CAG 145 
Tyr Ala Ser Leu Gly Glu Arg Val Thr He Thr Cys Lys Ala Ser Gin 
35 40 45 

GAC ATT AGA AAG TAT TTA AAC TGG TAC CAG CAG AAA CCA TGG AAA TCT 193 
Asp He Arg Lys Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Trp Lys Ser 
50 55 60 

CCT AAG ACC CTG ATC TAT TAT GCA ACA AGC TTG GCA GAT GGG GTC CCA 241 
Pro Lya Thr Leu He Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro 
65 70 75 

TCA AGA TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTA ACC ATC 289 
Ser Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He 
SO 85 90 95 

AGC AGC CTG GAG TCT GAC GAT ACA GCA ACT TAT TAC TGT CTA CAA CAT 337 
Ser Ser Leu Glu Ser Aep Asp Thr Ala Thr Tyr Tyr Cys Leu Gin Hie 
100 105 110 

GGT GAG AGC CCG TAC ACG TTC GGA GGG GGG ACC AAG CTG GAA ATA AAC 385 
Gly Glu Ser Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Asn 
115 120 125 
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AGG GCT GAT GCT GCA CCA ACT GTA TCC ATC TTC CCA CCA TCC AGT GAG 433 
Arg Ala Asp Ala Ala Pro Thr Val Ser lie Phe Pro Pro Ser Ser Glu 
1 130 135 140 

CAG TTA ACA TCT GGA GGT GCC TCA GTC GTG TGC TTC TTG AAC AAC TTC 481 
Gin Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe 
145 150 155 

TAC CCC AAA GAC ATC AAT GTC AAG TGG AAG ATT GAT GGC AGT GAA CGA 529 
5 Tyr Pro Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu Arg 
160 165 170 175 

CAA AAT GGC GTC CTG AAC AGT TGG ACT GAT CAG GAC AGC AAA GAC AGC 577 
Gin Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser 
180 185 190 

ACC TAC AGC ATG AGC AGC ACC CTC ACG TTG ACC AAG GAC GAG TAT GAA 625 
Thr Tyr Ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu 
10 195 200 205 

CGA CAT AAC AGC TAT ACC TGT GAG GCC ACT CAC AAG ACA TCA ACT TCA 673 
Arg His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser 
210 215 220 

CCC AAT GTC AAG AGC TTC AAC AAG AAT GAG TGT TAGAGACAAA GGTCCTGAGA 726 
Pro Asn Val Lys Ser Phe Asn Lys Asn Glu Cys 
225 230 

15 

CGCCACCACC AGCTCCCCAG CTCCATCCTA TCTTCCCTTC TAAGGTCTTG GAGGCTTCCC 786 
CACAAGCGAC CTACCACTGT TGCGGTGCTC CAAACCTCCT CCCCACCTCC TTCTCCTCCT 846 
CCTCCCTTTC CTTGGCTTTT ATCATGCTAA TATTTGCAGA AAATATTCAA TAAAGTGAGT 906 
CTTTGCACTT GAAAAAAAAA AAAAAAAAAA A 937 

20 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) -TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

1 Met Arg Ala Pro Ala Gin Phe Phe Gly lie Leu Leu Leu Trp Phe Pro 
1 5 10 15 

Gly He Arg Cys Asp He Lys Met Thr Gin Ser Pro Ser Ser Met Tyr 
20 25 30 

Ala Ser Leu Gly Glu Arg Val. Thr He Thr Cys Lys Ala Ser Gin Asp 
5 35 40 45 

He Arg Lys Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Trp Lys Ser Pro 
50 55 60 

Lys Thr Leu He Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro Ser 
65 70 75 80 

Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He Ser 
10 85 90 95 

Ser Leu Glu Ser Asp Asp Thr Ala Thr Tyr Tyr Cys Leu Gin His Gly 
100 105 110 

Glu Ser Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Asn Arg 
115 120 125 

Ala Asp Ala Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser Glu Gin 
15 130 135 140 

Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe Tyr 
145 150 155 160 

Pro Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu Arg Gin 
165 170 175 

Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser Thr 
20 180 185 190 

Tyr Ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu Arg 
195 200 205 

His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser Pro 
210 215 220 

Asn Val Lys Ser Phe Asn Lys Asn Glu Cys 
25 225 230 
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(2) INFORMATION FOR SEQ ID NO: 5: 

1 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Asp Asp Tyr Met His 

1 5 . 



10 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Leu He ABp Pro Glu Asn Gly Asn Thr He Tyr Lys Pro Lys Phe Gin 
1 5 10 is 

Gly 

20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Asn Ser Tyr Tyr Phe Asp Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 8: 

y (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: II amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY": linear 

(ii) MOLECULE TYPE: peptide 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Ala Ser Gin Asp He Arg Lys Tyr Leu Asn 
IS io 



(2) INFORMATION FOR SEQ ID NO: 9: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Tyr Ala Thr Ser Leu Ala Asp 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Leu Gin His Gly Glu Ser Pro Tyr Thr 
1 5 



10 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 



25 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Gin Val Gin Leu Val Gin Ser Glv Gly Gly Val Vai Gin Pro Gly Arg 
1 5 10 15 

Leu Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn lie Lys Asp Tyr 
20 25 30 

Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Leu lie Asp Pro Glu Asn Gly Asn Thr He Tyr Asp Pro Lys Phe 
50 55 60 

Gin Gly Arg Phe Ser He Ser Ala Asp Thr Ser Lys Asn Thr Ala Phe 
65 70 75 80 
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Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Pro 
100 105 no 

Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 is 

Asp Arg Val Thr He Thr Cys Lys Ala Ser Gin Asp He Arg Lys Tyr 
20 25 30 

Leu Asn Trp Tyr Gin Gin Lys Pro Trp Lys Ala Pro Lys Thr Leu He 
35 40 45 

Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly 
.50 55 60 

Ser Gly - ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 
65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Leu Gin His Gly Glu Ser Pro Tyr 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Glu He Thr Arg 
100 105 
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10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn lie Lya Aep Tyr 
20 25 30 

Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Leu lie Asp Pro Glu Asn Gly Asn Thr lie Tyr Asp Pro Lys Phe 
50 55 60 

Gin Gly Arg Phe Thr lie Ser Ala Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Pro 
ICG 105 110 

Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : s ingle 
25 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Abo He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
1 .5 10 15 

Asp Arq Val Thr He Thr Cys Lys Ala Ser Gin Asp He Arg Lys Tyr 
5 20 25 30 

Leu Asn Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He 
35 40 45 

Tyr Tyr Ala Thr Ser Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 
10 65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Leu Gin His Gly Glu Ser Pro Tyr 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Glu He Thr Arg 
100 1° 5 



15 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7073 base pairs 
f B) type ? nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 61.-717 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1111.. 1146 

25 
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( ix ) FEATURE : 

(A) NAME/KEY: CDS 
1 (B) LOCATION: 1268.. 1594 

(ix) FEATURE: 

(A) NAME/KEY: CDS * 

(B) LOCATION: 1692.. 2012 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAATTCGCCT CCACCATGGA ATGGAGCTGG GTCTTTCTCT TCTTCTTGTC AGTAACTACA 60 

GGT GTA CAC TCA CAA GTT CAG CTG GTG GAG TCT GGA GGA GGA GTA GTA 108 
Gly Val His Ser Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val 
1 5-10 15 

CAA CCT GGA AGG TCA CTG AGA CTG TCT TGT AAG GCT AGT GGA TTC AAT 156 
10 Gin Pro Gly Arg Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn 
20 25 30 

ATC AAG GAC TAT TAT ATG CAC TGG GTC AGA CAA GCT CCT GGA AAA GGA 204 
lie Lyg Asp Tyr Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly 
35 40 45 

CTC GAG TGG ATA GGT TTA ATT GAT CCT GAG AAT GGT AAC ACG ATA TAT 252 
Leu Glu Trp lie Gly Leu lie Asp Pro Glu Asn Gly Asn Thr lie Tyr 
15 50 55 60 

GAT CCC AAG TTC CAA GGA AGA TTC ATA ATT TCT GCA GAC AAC TCT AAG 300 
Asp Pro Lys Phe Gin Gly Arg Phe lie lie Ser Ala Asp Asn Ser Lys 
65 70 75 80 

AAT ACA CTG TTC CTG CAG ATG GAC TCA CTC AGA CCT GAG GAT ACA GCA 348 
Asn Thr Leu Phe Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala 
85 90 95 

20 

GTC TAC TTT TGT GCT AGA GAT AAC AGT TAT TAC TTC GAC TAC TGG GGC 396 
Val Tyr Phe Cye Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGA ACA CCA GTC ACC GTG AGC TCA GCT TCC ACC AAG GGC CCA TCC 444 
Gin Gly Thr Pro Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 
115 120 125 

25 GTC TTC CCC CTG GCG CCC TGC TCC AGG AGC ACC TCC GAG AGC ACA GCC 492 
Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu Ser Thr Ala 
130 135 140 



30 



35 



WO 96/40921 



-75- 



PCT/US96/09287 



GCC CTG GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG 540 
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val 
145 150 155 160 

TCG TGG AAC TCA GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT 588 
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala 
165 170 175 

GTC CTA CAG TCC TCA GGA CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG 636 
Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 
180 185 190 

CCC TCC AGC AGC TTG GGC ACG . AAG ACC TAC ACC TGC AAC GTA GAT CAC 684 
Pro Ser Ser Ser Leu Gly Thr Lys Thr Tyr Thr Cys Asn Val Asp His 
195 200 205 



AAG CCC AGC AAC ACC AAG GTG GAC AAG AGA GTT GGTGAGAGGC CAGCACAGGG 737 
Lys Pro Ser Asn Thr Lys Val Asp Lys Arg Val 
210 215 



CAGGG AGGGT 


GTCTGCTGGA 


AGCCAGGCTC 


AGCCCTCCTG 


CCTGGACGCA 


CCCCGGCTGT 


797 


GCAGCCCCAG 


CCCAGGGCAG 


CAAGGCATGC 


CCCATCTGTC 


TCCTCACCCG 


GAGGCCTCTG 


857 


ACCACCCCAC 


TCATGCTCAG 


GGAGAGGGTC 


TTCTGGATTT 


TTCCACCAGG 


CTCCGGGCAG 


917 


CCACAGGCTG 


GATGCCCCTA 


CCCCAGGCCC 


TGCGCATACA 


GGGGCAGGTG 


CTGCGCTCAG 


977 


ACCTGCCAAG 


AGCCATATCC 


GGGAGGACCC 


TGCCCCTGAC 


CTAAGCCCAC 


CCCAAAGGCC 


1037 


AAACTCTCCA 


CTCCCTCAGC 


TCAGACACCT 


TCTCTCCTCC 


CAGATTCGAG 


TAACTCCCAA 


1097 


TCTTCTCTCT 


GCA GAG TCC AAA TAT GGT CCC CCA TGC CCA TCA 
Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser 


TGC CCA 
Cys Pro 


1146 




1 




5 


10 






GGTAAGCCAA 


CCCAGGCCTC 


GCCCTCCAGC 


TCAAGGCGGG 


ACAGGTGCCC 


TAGAGTAGCC 


1206 


TGCATCCAGG 


GACAGGCCCC 


AGCCGGGTGC 


TGACGCATCC 


ACCTCCATCT 


CTTCCTCAGC 


1266 



A CCT GAG TTC CTG GGG GGA CCA TCA GTC TTC CTG TTC CCC CCA AAA 1312 
Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys 
15 10 15 

CCC AAG GAC ACT CTC ATG ATC TCC CGG ACC CCT GAG GTC ACG TGC GTG 1360 
Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val 
20 25 30 



WO 96/40921 



-76- 



PCTAJS96/09287 



GTG GTG GAC GTG AGC CAG GAA GAC CCC GAG GTC CAG TTC AAC TGG TAC 1408 
Val Val Asp Val Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr 
35 40 45 

GTG GAT GGC GTG GAG GTG CAT AAT GCC AAG ACA AAG CCG CGG GAG GAG 1456 
Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu 
50 55 60 

CAG TTC AAC AGC ACG TAC CGT GTG GTC AGC GTC CTC ACC GTC ATG CAC 1504 
Gin Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Met His 
65 70- 75 

CAG GAC TGG CTG AAC GGC AAG GAG TAC AAG TGC AAG GTC TCC AAC AAA 1552 
Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys 
80 85 90 95 

GGC CTC CCG TCC TCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 1594 
Gly Leu Pro Ser Ser lie Glu Lys Thr lie Ser Lys Ala Lys 
100 105 

GGTGGGACCC ACGGGGTGCG AGGGCCACAT GGACAGAGGT CAGCTCGGCC CACCCTCTGC 1654 

CCTGGGAGTG ACCGCTGTGC CAACCTCTGT CCCTACA GGG CAG CCC CGA GAG CCA 1709 

Gly Gin Pro Arg Glu Pro 
1 5 

CAG GTG TAC ACC CTG CCC CCA TCC CAG GAG GAG ATG ACC AAG AAC CAG 1757 
Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys Asn Gin 
10 15 20 

GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAC CCC AGC GAC ATC GCC 1805 
Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala 
25 30 35 

GTG GAG TGG GAG AGT AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 1853 
Val: Glu 'Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr 
40 45 50 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AGG CTA 1901 
Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu 
55 60 65 70 

ACC GTG GAC AAG AGC AGG TGG CAG GAG GGG AAT GTC TTC TCA GTC TCC 1949 
Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser Val Ser 
75 80 05 
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GTG ATG CAT GAG GCT CTG CAC AAC CAC TAC ACA CAG AAG AGC CTC TCC 1997 
Val Met His Glu Ala Leu' His Asn His Tyr Thr Gin Lys Ser Leu Ser 
90 95 100 

CTG TCT CTG GGT AAA TGAGTGCCAG GGCCGGCAAG CCCCCGCTCC CCGGGCTCTC 2052 
Leu Ser Leu Gly Lys 
105 



GGGGTCGCGC 


GAGGATGCTT 


GGCACGTACC 


CCGTCTACAT 


ACTTCCCAGG 


CACCCAGCAT 


2112 


GGAAATAAAG 


CACCCACCAC 


TGCCCTGGGC 


CCCTGTGAGA 


CTGTGATGGT 


TCTTTCCACG 


2172 


GGTCAGGCCG 


AGTCTGAGGC 


CTGAGTGACA 


TGAGGGAGGC 


AGAGCGGGTC 


CCACTGTCCC 


2232 


CACACTGGCC 


CAGGCTGTGC 


AGGTGTGCCT 


GGGCCACCTA 


GGGTGGGGCT 


CAGCCAGGGG 


2292 


CTGCCCTCGG 


CAGGGTGGGG 


GATTTGCCAG 


CGTGGCCCTC 


CCTCCAGCAG 


CAGGACTCTA 


2352 


GAGGATCATA 


ATCAGCCATA 


CCACATTTGT 


AGAGGTTTTA 


CTTGCTTTAA 


AAAACCTCCC 


2412 


ACACCTCCCC 


CTGAACCTGA 


AACATAAAAT 


GAATGCAATT 


GTTGTTGTTA 


ACTTGTTTAT 


2472 


TGCAGCTTAT 


AATGGTTACA 


AATAAAGCAA 


TAGCATCACA 


AATTTCACAA 


ATAAAGCATT 


2532 


TTTTTCACTG 


CATTCTAGTT 


GTGGTTTGTC 


CAAACTCATC 


AATGTATCTT 


ATCATGTCTG 


2592 


GATCCTCTAC 


GCCGGACGCA 


TCGTGGCCGG 


CATCACCGGC 


GCCACAGGTG 


CGGTTGCTGG 


2652 




GCCGACATCA 


CCGATGGGGA 


AGATCGGGCT 


CGCCACTTCG 


GGCTCATGAG 


2712 


CGCTTGTTTC 


GGCGTGGGTA 


TGGTGGCAGG 


CCCGTGGCCG 


GGGGACTGTT 


GGGCGCCATC 


2772 


TCCTTGCATG 


CACCATTCCT 


TGCGGCGGCG 


GTGCTCAACG 


GCCTCAACCT 


ACTACTGGGC 


2832 


TGCTTCCTAA 


TGCAGGAGTC 


GCATAAGGGA 


GAGCGTCGAC 


CTCGGGCCGC 


GTTGCTGGCG 


2892 


TTTTTCCATA 


GGCTCCGCCC 


CCCTGACGAG 


CATCACAAAA 


ATCGACGCTC 


AAGTCAGAGG 


2952 


TGGCGAAACC 


CGACAGGACT 


ATAAAGATAC 


CAGGCGTTTC 


CCCCTGGAAG 


CTCCCTCGTG 


3012 


CGCTCTCCTG 


TTCCGACCCT 


GCCGCTTACC 


GGATACCTGT 


CCGCCTTTCT 


CCCTTCGGGA 


3072 


AGCGTGGCGC 


TTTCTCAATG 


CTCACGCTGT 


AGGTATCTCA 


GTTCGGTGTA 


GGTCGTTCGC 


3132 


TCCAAGCTGG 


GCTGTGTGCA 


CGAACCCCCC 


GTTCAGCCCG 


ACCGCTGCGC 


CTTATCCGGT 


3192 


AACTATCGTC 


TTGAGTCCAA 


CCCGGTAAGA 


CACGACTTAT 


CGCCACTGGC 


AGCAGCCACT 


3252 
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GGTAACAGGA 


TTAGCAGAGC 


GAGGTATGTA 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGTGG 


3312 


CCTAACTACG 


G CT AC ACT AG 


AAGGACAGTA 


TTTGGTATCT 


GCGCTCTGCT 


GAAGCCAGTT 


3372 


ACCTTCGGAA 


AAAGAGTTGG 


TAGCTCTTGA 


TCCGGCAAAC 


AAACCACCGC 


TGGTAGCGGT 


3432 


GGTTTTTTTG 


TTTGCAAGCA 


GCAGATTACG 


CGCAGAAAAA 


AAGGATCTCA 


AGAAGATCCT 


3492 


TTGATCTTTT 


CTACGGGGTC 


TGACGCTCAG 


TGGAACGAAA 


ACTCACGTTA 


AGGGATTTTG 


3552 


GTCATGAGAT 


TATCAAAAAG 


GATCTTCACC 


TAGATCCTTT 


TAAATTAAAA 


A Iti AAG , 1"1*I W 1 1 

* * x x x x x 


3612 


AAATCAATCT 


AAAGTATATA 


TGAGTAAACT 


TGGTCTGACA 


GTTACCAATG 


CPTA ATfAf^T* 
v x x nn x v«n>w x 


3672 


GAGGCACCTA 


TCTCAGCGAT 


CTGTCTATTT 


CGTTCATCCA 


TAGTTGCCTG 




3732 


GTGTAGATAA 


CTACGATACG 


GGAGGGCTTA 


CCATCTGGCC 


CCAGTGCTGC 


AATGATACCG 


3792 


CGAGACCCAC 


GCTCACCGGC 


TCCAGATTTA 


TCAGCAATAA 


ACCAGCCAGC 


CGGAAGGGCC 


3852 


GAGCGCAGAA 


GTGGTCCTGC 


AACTTTATCC 


GCCTCCATCC 


AGTCTATTAA 


TTGTTGCCGG 

X X W X X UVVfUv 


3912 


GAAGCTAGAG 


TAAGTAGTTC 


G CCAGTTAAT 


AGTTTGCGCA 


ACG TTGTTGC 


CATTGCTACA 


3972 


GGCATCGTGG 


TGTCACGCTC 


GTCGTTTGGT 


ATGGCATCAT 


TCAGCTCCGG 


TTCCCAACGA 


4032 


TCAAGGCGAG 


TTACATG AT C 


CCCCATGTTG 

\+ vK^\*f»X \J X X\7 


TGCAAAAAAG 


CGGTTAGCFf 


x x i«»vfvj x WW x 




CCGATCGTTG 


TCAGAAGTAA 


GTTGG CCGCA 


fZTCIT A TCAf* 
w x w x x *» x wii\> 


TP A Tftf2T*P A T 


uuwau WnW X v 


4X34 


CATAATTCTC 


TTACTGTCAT 

X X ** X ^7 X w<I X 


GPOATPCGTA 




^1 \* X \w»V* XWw 


*Pfi AfiT* A fPf* A 
lunulAVolUi 




ACCAAGTCAT 


TCTGAGAATA 


GTGTATGCGG 


CGACCGAGTT 


CCTPTTR ff^f* 

W X W X X W 


CCPJ3TP.AAPJI 




CGGGATAATA 


CCGCGCGACA 


TAGCAGAACT 


TTAAAAGTGC 


TCATCATTPGC 

X V*>n X X»*»X X W 


AAAACGT**! t "P 

£UWW\WW X X W X 


4332 


TCGGGGCGAA 


AACTCTCAAG 


GATCTTACCG 


CTGTTGAGAT 


CCAGTTCGAT 


GTAACCCACT 


4392 


CGTGCACCCA 


ACTGATCTTC 


AGCATCTTTT 


ACTTTCACCA 


GOGTTTCTGG 


GTGAGCAAAA 


4452 


ACAGGAAGGC 


AAAATGCCGC 


AAAAAAGGGA 


ATAAGGGOGA 


CACGGAAATG 


TTGAATACTC 


4512 


ATACTCTTCC 


TTTTTCAATA 


TTATTGAAGC 


ATTTATCAGG 


GTTATTGTCT 


CATGAGCGGA 


4572 


TACATATTTG 


AATGTATTTA 


GAAAAATAAA 


CAAATAGGGG 


TTCCGCGCAC 


ATTTCCCCGA 


4632 


AAAGTGCCAC 


CTGACGTCTA 


AGAAACCATT 


ATTATCATGA 


CATTAACCTA 


TAAAAATAGG 


4692 
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CGTATCACGA 


GGCCCTGATG 


GCTCTTTGCG 


GCACCCATCG 


TTCGTAATGT 


TCCGTGGCAC 


4752 


CGACGACAAC 


CCTCAAGAGA 


AAATGTAATC 


ACACTGGCTC 


ACCTTCGGGT 


GGGCCTTTCT 


4812 


GCGTTTATAA 


GGAGACACTT 


TATGTTTAAG 


AAGGTTGGTA 


AATTCCTTGC 


GGCTTTGGCA 


4872 


GCCAAGCTAG 


AGATCTCTAG 


CTTCGTGTCA 


AGGACGGTGA 


CTGCAGTGAA 


TAATAAAATG 


4932 


TGTGTTTGTC 


CGAAATACGC 


GTTTTGAGAT 


TTCTGTCGCC 


GACTAAATTC 


ATGTCGCGCG 


4992 


ATAGTGGTGT 


TTATCGCCGA 


TAGAGATGGC 


GATATTGGAA 


AAATCGATAT 


TTGAAAATAT 


5052 


GGCATATTGA 


AAATGTCGCC 


GATGTGAGTT 


TCTGTGTAAC 


TGATATCGCC 


ATTTTTCCAA 


5112 


AAGTGATTTT 


TGGGCATACG 


CGATATCTGG 


CGATAGCGCT 


TATATCGTTT 


ACGGGGGATG 
• 


5172 


GCGATAGACG 


ACTTTGGTGA 


CTTGGGCGAT 


TCTGTGTGTC 


GCAAATATCG 


CAGTTTT3GAT 

^+mm\* AAA \*^&+\ A 


5232 


ATAGGTGACA 


GACGATATGA 


GGCTATATCG 


CCGATAGAGG 


CGACATCAAG 


CTGCCAPATG 




GCCAATGCAT 


ATCGATCTAT 


ACATTGAATC 


AATATTGGCC 


ATTAGCCATA 


TTATTOATTY2 

X XXIX XVfAllli 




GTTATATAGC 


ATAAATCAAT 


ATTGGCTATT 


G GCC ATTGCA 


TACGYTGTAT 


CCATATCATA 
w wn xxi x wax n 




ATATGTACAT 


TTATATTGGC 


TCATGTCCAA 


CATTACCGCC 


ATGTTGACAT 


TG ATTATTG A 

*WX»X AAA x wn 


5472 


CT AG TT ATT A 


ATAGTAATCA 


ATTACGGGGT 


CATTAGTTCA 


TAGCCCATAT 


ATGGAGTTCC 

nxiwxuv x x 


5532 


GCGTTACATA 


ACTTACGGTA 


AATGGCCCGC 


CTGGCTGACC 


GCCCAACGAC 


WWW WW WW Wi X 


<55Q2 


TGACGTCAAT 


AATGACGTAT 
nni vnw x *v x 


CTTPCf* AT AG 

\Jr X X vWvn X TWj 


T A APnrr* A AT 




WAX XvAVAi X W 


ccto 


AATGGGTGGA 


GT ATTT A C^CZCX 

w ±*LJL X XXI WW 


TAAACTGCCG 

ilUUlbi WWWW 


nui X WWfMvX 


nwl X WX\X\V7 X w 


XaXwAXAX\jw 


9 / 


CAAGTACGCC 




fITP A A Tfi A Pfl 


{IT AAA TClflCT* 


w\» WW XuuUl X 


XAIuCCCAIjT 


5772 


ACATGACCTT 


ATGGGACTTT 


CCTACTTGGC 


AGTACATCTA 


CGTATTAGTC 


ATCGCTATTA 


5832 


CCATGGTGAT 


GCGGTTTTGG 


CAGTACATCA 


ATGGGCGTGG 


ATAGCGGTTT 


GACTCACGGG 


5892 


GATTTCCAAG 


TCTCCACCCC 


ATTGACGTCA 


ATGGGAGTTT 


GTTTTGGGAC 


CAAAATCAAC 


5952 


GGGACTTTCC 


AAAATGTCGT 


AACAACTCCG 


CCCCATTGAC 


GCAAATGGGC 


GGTAGGCGTG 


6012 


TACGGTGGGA 


GGTCTATATA 


AGCAGAGCTC 


GTTTAGTGAA 


CCGTCAGATC 


GCCTGGAGAC 


6072 


GCCATCCACG 


CTGTTTTGAC 


CTCCATAGAA 


GACACCGGGA 


CCGATCCAGC 


CTCCGCGGCC 


6132 
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GGGAACGGT6 


CATTGGAACG 


CGGATTCCCC 


GTGCCAAGAG 


TGACGTAAGT 


ACCGCCTATA 


6192 


GAGTCTATAG 


GCCCACCCCC 


TTGGCTTCTT 


ATGCATGCTA 


TACTGTTTTT 


GGCTTGGGGT 


6252 


CTATACACCC 


CCGCTTCCTC 


ATGTTATAGG 


TGATGGTATA 


GCTTAGCCTA 


TAGGTGTGGG 


6312 


TTATTGACCA 


TTATTGACCA 


CTCCCCTATT 


GGTGACGATA 


CTTTCCATTA 


CTAATCCATA 


6372 


ACATGGCTCT 


TTGCCACAAC 


TCTCTTTATT 


GGCTATATGC 


CAATACACTG 


TCCTTCAGAG 


6432 


ACTGACACGG 


ACTCTGTATT 


TTTACAGGAT 


GGGGTCTCAT 


TTATTATTTA 


CAAATTCACA 


6492 


TATACAACAC 


CACCGTCCCC 


AGTGCCCGCA 


GTTTTTATTA 


AACATAACGT 


GGGATCTCCA 


6552 


CGCGAATCTC 


GGGTACGTGT 


TCCGGACATG 


GGCTCTTCTC 


CGGTAGCGGC 


GGAGCTTCTA 




CATCCGAGCC 


CTGCTCCCAT 


CCCTCCAGCG 


ACTCATGGTC 


GCTCGGCAGC 


TCCTTGCTCC 


DO / A 


TAACAGTGGA 


GGCCAGACTT 


AGGCACAGCA 


CGATGCCCAC CACCACCAGT GTGCCGCACA 


D / 


AGGCCGTGGC 


GGTAGGGTAT 


GTGTCTGAAA 


ATGAGCTCGG 


GGAGCGGGCT 


TGCACCGCTG 


6792 


ACGCATTTGG 


AAGACTTAAG 


GCAGCGGCAG 


AAGAAGATGC 


AGGCAGCTGA 




6852 


TCTGATAAGA 


GTCAGAGGTA 


ACTCCCGTTG 


CGGTGCTGTT 


AACGGTGGAG 


GGCAGTGTAG 


6912 


TCTGAGCAGT 


ACTCGTTGCT 


GCCGCGCGCG 


CCACCAGACA 


TAATAGCTGA 


CAGACTAACA 


6972 


GACTGTTCCT 


TTCCATGGGT 


CTTTTCTGCA 


GTCACCGTCC 


TTGACACGAA 


GCTTGGGCTG 


7032 


CAGGTCGATC 


GACTCTAGAG 


GATCGATCCC 


CGGGCGAGCT 


C 




7073 



(2) INFORMATION FOR SEQ ID NO: 16: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 
30 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

1 Gly Val His Ser Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val 
15 10 is 

Gin Pro Gly Arg Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Phe Asn 
20 25 30 

He Lys Asp Tyr Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly 
5 35 40 45 

Leu Glu Trp He Gly Leu He Asp Pro Glu Asn Gly Asn Thr He Tvr 
50 55 60 

Asp Pro Lys Phe Gin Gly Arg Phe He He Ser Ala Asp Asn Ser Lys 
65 70 75 . 80 

Asn Thr Leu Phe Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala 
10 85 90 95 

Val Tyr Phe Cys Ala Arg Asp Asn Ser Tyr Tyr Phe Asp Tyr Trp Gly 
100 105 no 

Gin Gly Thr Pro Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 
115 120 125 

Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu Ser Thr Ala 
15 130 135 140 

Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val 
145 150 155 160 

Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala 
165 170 175 

Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 
20 180 185 190 

Pro Ser Ser Ser Leu Gly Thr Lys Thr Tyr Thr Cys Asn Val Asp His 
195 200 205 

Lys Pro Ser Asn Thr Lys Val Asp Lys Arg Val 
210 215 

25 



30 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Ser Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Pro Glu Phe Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lya Pro 
1 5 10 is 

Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val 
20 25 30 

Val Asp Val Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val 
20 35 40 45 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
50 55 60 

Phe Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Met His Gin 
65 70 75 80 

Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly 
25 85 90 95 
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Leu Pro Ser Ser lie Glu Lys Thr lie Ser Lys Ala Lys 
100 105 

1 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Gin Glu 
10 1 5 10 15 

Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe 
20 25 30 

Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu 
35 40 45 

Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe 
15 50 55 60 

Phe Leu Tyr Ser Arg Leu Thr Val Asp Lys Ser Arg Trp Gin Glu Gly 
65 70 75 80 

Asn Val Phe Ser Val Ser Val Met His Glu Ala Leu His Asn His Tyr 
85 90 95 

Thr Gin Lys Ser Leu Ser Leu Ser Leu Gly Lys 
20 100 105 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7864 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 711 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AATTCACCAT 


GGGTGTGCCA 


ACTCAGGTAT 


TAGGATTACT 


GCTGCTGTGG 


CTTACAGATG 


60 


CAAGATGTGA 


TATCCAAATG 


ACACAATCTC 


CTTCTTCTCT 


AAGTGCTTCT 


GTCGGAGATA 


120 


GAGTAACAAT 


TACATGTAAG 


GCGAGTCAGG 


ACATTAGAAA 


GTATTTAAAC 


TGGTATCAGC 


180 


AAAAACCTGG 


GAAGGCTCCT 


AAGCTACTGA 


TTTATTATGC 


AACAAGTTTG 


GCAGATGGAG 


240 


TACCTTCTAG 


ATTTTCTGGT 


TCTGGCTCTG 


GAACAGACTA 


CACATTCACA 


ATTTCTTCTC 


300 


TCCAACCTGA 


GGACATTGCT 


ACATACTACT 


GCCTACAACA 


TGGTGAGAGT 


CCGTATACAT 


360 


TTGGACAAGG 


AACAAAACTA 


GAGATCACAA 


GAACTGTTGC 


GGCGCCGTCT 


GTCTTCATCT 


420 


TCCCGCCATC 


TGATGAGCAG 


TTGAAATCTG 


GAACTGCCTC 


TGTTGTGTGC 


CTGCTGAATA 


480 


ACTTCTATCC 


CAGAGAGGCC 


AAAGTACAGT 


GGAAGGTGGA 


TAACGCCCTC 


CAATCGGGTA 


540 


ACTCCCAGGA 


GAGTGTCACA 


GAGCAGGACA 


GCAAGGACAG 


CACCTACAGC 


CTCAGCAGCA 


600 


CCCTGACGCT 


GAGCAAAGCA 


GACTACGAGA 


AACACAAAGT 


CTACGCCTGC 


GAAGTCACCC 


660 


ATCAGGGCCT 


GAGCTCGCCC 


GTCACAAAGA 


GCTTCAACAG 


GGGAGAGTGT 


TAGAGGGAGA 


720 


AGTGCCCCCA 


CCTGCTCCTC 


AGTTCCAGCC 


TGGGGATCAT 


AATCAGCCAT 


ACCACATTTG 


780 


TAGAGGTTTT 


ACTTGCTTTA 


AAAAACCTCC 


CACACCTCCC 


CCTGAACCTG 


AAACATAAAA 


840 


TGAATGCAAT 


TGTTGTTGTT 


AACTTGTTTA 


TTGCAGCTTA 


TAATGGTTAC 


AAATAAAGCA 


900 


ATAGCATCAC 


AAATTTCACA 


AATAAAGCAT 


TTTTTTCACT 


GCATTCTAGT 


TGTGGTTTGT 


960 


CCAAACTCAT 


CAATGTATCT 


TATCATGTCT 


GGATCCTCTA 


CGCCGGACGC 


ATCGTGGCCG 


1020 


GCATCACCGG 


CGCCACAGGT 


GCGGTTGCTG 


GCGCCTATAT 


CGCCGACATC 


ACCGATGGGG 


1080 


AAGATCGGGC 


TOGCCACTTC 


GGGCTCATGA 


GCGCTTGTTT 


CGGCGTGGGT 


ATGGTGGCAG 


1140 
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GCCCGTGGCC 


GGGGGACTGT 


TGGGCGCCAT 


CTCCTTGCAT 


GCACCATTCC 


TTGCGGCGGC 


1200 


GGTGCTCAAC 


GGCCTCAACC 


TACTACTGGG 


CTGCTTCCTA 


ATGCAGGAGT 


CG CAT AAGGG 


1260 


AGAGCGTCGA 


CCTCGGGCCG 


CGTTGCTGGC 


GTTTTTCCAT 


AGGCTCCGCC 


CCCCTGACGA 


1320 


GCATCACAAA 


AATCGACGCT 


CAAGTCAGAG 


GTGGCGAAAC 


CCGACAGGAC 


TATAAAGATA 


1380 


CCAGGCGTTT 


CCCCCTGGAA 


GCTCCCTCGT 


GCGCTCTCCT 


GTTCCGACCC 


TGCCGCTTAC 


1440 


CGGATACCTG 


TCCGCCTTTC 


TCCCTTCGGG 


AAGCGTGGCG 


CTTTCTCAAT 


GCTCACGCTG 


1500 


TAGGTATCTC 


AGTTCGGTGT 


AGGTCG TTCG 


CTCCAAGCTG 


GGCTGTGTGC 


ACGAACCCCC 


1560 


CGTTCAGCCC 


GACCG CTGCG 


CCTTATCCGG 


TAACTATCGT 


CTTGAGTCCA 


ACCCGGTAAG 


1620 


ACACGACTTA 


TCGCCACTGG 


CAGCAGCCAC 


TGGTAACAGG 


ATTAGCAGAG 


CGAGGTATGT 


1680 


AGGCGGTGCT 


ACAGAGTTCT 


TGAAGTGGTG 


GCCTAACTAC 


GGCTACACTA 


GAAGGACAGT 


1740 


ATTTGGTATC 


TGCGCTCTGC 


TGAAGCCAGT 


TACCTTCGGA 


AAAAGAGTTG 


GTAGCTCTTG 


1800 


ATCCGGCAAA 


CAAACCACCG 


CTGGTAGCGG 


TGGTTTTTTT 


GTTTGCAAGC 


AGCAGATTAC 


1860 


GCGCAGAAAA 


AAAGGATCTC 


AAGAAGATCC 


TTTGATCTTT 


TCTACGGGGT 


CTGACGCTCA 


1920 


GTGGAACGAA 


AACTCACGTT 


AAGGGATTTT 


GGTCATGAGA 


TTATCAAAAA 


GGATCTTCAC 


1980 


CTAGATCCTT 


TTAAATTAAA 


AATGAAGTTT 


TAAATCAATC 


TAAAGTATAT 


ATGAGTAAAC 


2040 


TTGGTCTGAC 


AGTTACCAAT 


GCTTAATCAG 


TGAGGCACCT 


ATCTCAGCGA 




2100 


TCGTTCATCC 


ATAGTTGCCT 


GACTCCCCGT 


CGTGTAGATA 


ACTACGATAC 


GGGAGGGCTT 


2160 


ACCATCTGGC 


CCCAGTG CTG 


CAATGATACC 


GCGAGACCCA 


CGCTCACCGG 


CTCCAGATTT 


2220 


AX WnutvAA X A 


AACCAG CCAG 


CCGG AAGGG C 


CGAGCGCAGA 


AGTGGTCCTG 


CAACTTTATC 


2280 


CGCCTCCATC 


CAGTCTATTA 


ATTGTTGCCG 


GGAAGCTAGA 


GTAAGTAGTT 


CGCCAGTTAA 


2340 


TAGTTTGCGC 


AACGTTGTTG 


CCATTGCTAC 


AGGCATCGTG 


GTGTCACGCT 


CGTCGTTTGG 


2400 


TATGGCTTCA 


TTCAGCTCCG 


GTTCCCAACG 


ATCAAGGCGA 


GTTACATGAT 


CCCCCATGTT 


2460 


GTGCAAAAAA 


GCGGTTAGCT 


CCTTCGGTCC 


TCCGATCGTT 


GTCAGAAGTA 


AGTTGGCCGC 


2520 


AGTGTTATCA 


CTCATGGTTA 


TGGCAGCACT 


GCATAATTCT 


CTTACTGTCA 


TGCCATCOGT 


2580 
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AAGATGCTTT 


TCTGTGACTG 


GTGAGTACTC 


AACCAAGTCA 


TTCTGAGAAT 


AGTGTATGCG 


2640 


GCGACCGAGT 


TGCTCTTGCC 


CGGCGTCAAC 


ACGGGATAAT 


ACCGCGCCAC 


ATAGCAGAAC 


2700 


TTTAAAAGTG 


CTCATCATTG 


GAAAACGTTC 


TTCGGGGCGA 


AAACTCTCAA 


GGATCTTACC 


2760 


GCTGTTGAGA 


TCCAGTTCGA 


TGTAACCCAC 


TCGTGCACCC 


AACTGATCTT 


CAG CAT CTTT 


2820 


TACTTTCACC 


AGCGTTTCTG 


GGTGAGCAAA 


AACAGGAAGG 


CAAAATGCCG 


CAAAAAAGGG 


2880 


AATAAGGGCG 


ACACGGAAAT 


GTTGAATACT 


CATACTCTTC 


CTTTTTCAAT 


ATTATTGAAG 


2940 


CATTTATCAG 


GGTTATTGTC 


TCATGAGCGG 


ATACATATTT 


GAATGTATTT 


AGAAAAATAA 


3000 


ACAAATAGGG 


GTTCCGCGCA 


CATTTCCCCG 


AAAAGTGCCA 


CCTGACGTCT 


AAGAAACCAT 


3060 


TATTATCATG 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


AGGCCCTGAT 


GGCTCTTTGC 


3120 


GGCACCCATC 


GTTCGTAATG 


TTCCGTGGCA 


CCGAGGACAA 


CCCTCAAGAG 


AAAATGTAAT 


3180 


CACACTGGCT 


CACCTTCGGG 


TGGGCCTTTC 


TGCGTTTATA 


AGGAGACACT 


TTATGTTTAA 


3240 


GAAGGTTGGT 


AAATTCCTTG 


OGGCTTTGGC 


AGCCAAGCTA 


GAGATCCGGC 


TGTGGAATGT 


3300 


GTGTGAGTTA 


GGGTGTGGAA 


AGTCCCCAGG 


CTCCCCAGCA 


GGCAGAAGTA 


TGCAAAGCAT 


3360 


GCATCTCAAT 


TAGTCAGCAA 


CCAGGCTCCC 


CAGCAGGCAG 


AAGTATGCAA 


AGCATGCATC 


3420 


TCAATTAGTC 


AGCAACCATA 


GTCCCGCCCC 


TAACTCCGCC 


CATCCCGCCC 


CTAACTCCGC 


3480 


CCAGTTCCGC 


CCATTCTCCG 


CCCCATGGCT 


GACTAATTTT 


J i Txt ATTT AT 


GCAGAGGCCG 




AGGCCGCCTC 


GGCCTCTGAG 


CTATTC CAG A 


AGTAGTGAGG 


AGGCTTTTTT 


GGAGGCCTAG 


3600 


GCTTTTGCAA 


AAAGCTAGCT 


TGGGGCCACC 


GCTCAGAGCA 


CCTTCCACCA 






jifzr*TL &f"P'i'r'f* 
nuWinu X 1A~1*. 


CALX itJAACA 


AAAACATCAA 


GGAAATGTAC 


TTGTGCCTGC 


CCCAGGGTGA 


3720 


GAAAGTCCAA 


GCCATGTATA 


TCTGGGTTGA 


TGGTACTGGA 


GAAGGACTGC 


GCTGCAAAAC 


3780 


CCGCACCCTG 


GACTGTGAGC 


CCAAGTGTGT 


AGAAGAGTTA 


CCTGAGTGGA 


ATTTTGATGG 


3840 


CTCTAGTACC 


TTTCAGTCTG 


AGGGCTCCAA 


CAGTGAGATG 


TATCTCAGCC 


CTGTTGCCAT 


3900 


GTTTCGGGAC 


CCCTTCCGCA 


GAGATCCCAA 


CAAGCTGGTG 


TTCTGTGAAG 


TTTTCAAGTA 


3960 


CAACCGGAAG 


CCTGCAGAGA 


CCAATTTAAG 


GCACTCGTGT 


AAACGGATAA 


TGGACATGGT 


4020 
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GAGCAACCAG 


CACCCCTGGT 


TTGGAATGGA 


ACAGGAGTAT 


ACTCTGATGG 


GAACAGATGG 


4080 


GCACCCTTTT 


GGTTGGCCTT 


CCAATGGCTT 


TCCTGGGCCC 


CAAGGTCCGT 


ATTACTGTGG 


4140 


TGTGGGCGCA 


GACAAAGCCT 


ATGGCAGGGA 


TATCGTGGAG 


GCTCACTACC 


GCGCCTGCTT 


4200 


GTATGCTGGG 


GTCAAGATTA 


CAGGAACAAA 


TGCTGAGGTC 


ATGCCTGCCC 


AGTGGGAACT 


4260 


CCAAATAGGA 


CCCTGTGAAG 


GAATCCGCAT 


GGGAGATCAT 


CTCTGGGTGG 


CCCGTTTCAT 


4320 


CTTNCATCGA 


GTATGTGAAG 


ACTTTGGGGT 


AATAGCAACC 


TTTGACCCCA 


AGCCCATTCC 


4380 


TGGGAACTGG 


AATGGTGCAG 


GCTGCCATAC 


GAACTTTAGC 


ACCAAGGCCA 


TGCGGGAGGA 


4440 


GAATGGTCTG 


AAGCACATCG 


AGGAGGCCAT 


CGAGAAACTA 


AGCAAGCGGC 


ACCGGTACCA 


4500 


CATTCGAGCC 


TACGATCCCA 


AGGGGGGCCT 


GGACAATGCC 


CGTGGTCTGA 


CTGGGTTCCA 


4560 


CGAAACGTCC 


AACATCAACG 


ACTTTTCTGC 


TGGTGTCGCC 


AATCGCAGTG 


CCAGCATCCG 


4620 


CATTCCCCCG 


ACTGTCGGCC 


AGGAGAAGAA 


AGGTTACTTT 


GAAGACCGCG 


GCCCCTCTGC 


4680 


CAATTGTGAC 


CCCTTTGCAG 


TGACAGAAGC 


CATCGTCCGC 


ACATGCCTTC 


TCAATGAGAC 


4740 


TGGCCACGAG 


CCCTTCCAAT 


ACAAAAACTA 


ATTAGACTTT 


GAGTGATCTT 


GAGCCTTTCC 


4800 


TAGTTCATCC 


CACCCCGCCC 


CAGAGAGATC 


TTTGTGAAGG 


AACCTTACTT 


CTGTGGTGTG 


4860 


ACATAATTGG 


ACAAACTACC 


TACAGAGATT 


TAAAGCTCTA 


AGGTAAATAT 


AAAATTTTTA 


4920 


AGTGTATAAT 


GTGTTAAACT 


ACTGATTCTA 


ATTGTTTGTG 


TATTTTAGAT 


TCCAACCTAT 


4980 


GGAACTGATG 


AATGGGAGCA 


GTGGTGGAAT 


GCCTTTAATG 


AGGAAAACCT 


GTTTTGCTCA 


5040 


GAAGAAATGC 


CATCTAGTGA 


TGATGAGGCT 


ACTGCTGACT 


CTCAACATTC 


TACTCCTCCA 


5100 


AAAAAGAAGA 


GAAAGGTAGA 


ACACCCCAAG 


GACTTTCCTT 


CAGAATTGCT 


AAGTTTTTTG 


5160 


AGTCATGCTG 


TGTTTAGTAA 


TAGAACTCTT 


GCTTGCTTTG 


CTATTTACAC 


CACAAAGGAA 


5220 


AAAGCTGCAC 


TGCTATACAA 


GAAAATTATG 


GAAAAATATT 


CTGTAACCTT 


TATAAGTAGG 


5280 


CATAACAGTT 


ATAATCATAA 


CATACTGTTT 


TTTCTTACTC 


CACACAGGCA 


TAGAGTGTCT 


5340 


GCTATTAATA 


ACTATGCTCA 


AAAATTGTGT 


ACCTTTAGCT 


TTTTAATTTG 


TAAAGGGGTT 


5400 


AATAAGGAAT 


ATTTGATGTA 


TAGTGCCTAG 


ACTAGAGATC 


ATAATCAGCC 


ATACCACATT 


5460 
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TGTAGAGGTT 


TTACTTCCTT 


TAAAAAACCT 


CCCACACCTC 


CCCCTGAACC 


TGAAACATAA 


5520 


AATGAATGCA 


ATTGTTGTTG 


TTAACTTGTT 


TATTGCAGCT 


TATAATGGTT 


ACAAATAAAG 


5580 


CAATAGCATC 


ACAAATTTCA 


CAAATAAAGC 


ATTTTTTTCA 


CTGCATTCTA 


GTTGTGGTTT 


5640 


GTCCAAACTC 


ATCAATGTAT 


CTTATCATGT 


CTGGATCTCT 


AGCTTCGTGT 


CAAGGACGGT 


5700 


GACTGCAGTG 


AATAATAAAA 


TGTGTGTTTG 


TCCGAAATAC 


GCGTTTTGAG 


ATTTCTGTCG 


5760 


CCTACTAAAT 


TCATGTCGCG 


CGATAGTGGT 


GTTTATCGCC 


GATAGAGATG 


GCGATATTGG 


5820 


AAAAATCGAT 


ATTTGAAAAT 


ATGGCATATT 


GAAAATGTCG 


CCGATGTGAG 


TTTCTGTGTA 


5680 


ACTGATATCG 


CCATTTTTCC 

\*s*c*a a a a a\*\* 


AAAAGTGATT 


TTTGGGCATA 


CGCGATATCT 


GGCGATAGCG 


5940 


CTTATATCGT 


TTACGGGGGA 

A X*»WWWw*» 


TGGCGATAGA 


CGACTTTGGT 


GACTTGGGCG 


ATTCTGTY3Tf2 


GO no 


TCGCAAAT AT t 


CGCAGT*I*TCfi 

wUwIW X A A 


ATATAGGTGA 


CAGACGATAT 


GAGGCTATAT 


CGCCGATAGA 


6060 


GGCGACATCA 


AGCTGGCACA 


TGGCCAATGC 


ATATCGATCT 


ATACATTGAA 


TCAATATTGG 


6120 


CCATTAGCCA 


*I* A T*P A *WC* AT 

XXIX IflA X ***\A 


TGGTTATATA 


GCATAAATCA 


ATATTGGCTA 


TTCC f*f* A'mYl 


6180 


CATACGTTGT 


ATCCATATCA 


TAATATGTAC 


ATTTATATTG 


GCTCATGTCC 


AACATTACCG 


6240 


CGATGTTGAC 


ATTGATTATT 


GACTAGTTAT 


TAATAGTAAT 


CAATTACGGG 


GTCATTAGTPT 


'6300 


CAT AGCC CAT 


ATATGGAGTT 


CCGCGTTACA 


TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


6360 


COGCCCAACG 


ACCCCCGCCC 


ATTGACGTCA 


ATAATGACGT 


ATGTTCCCAT 

m»A^& A A ^^WWC*X 


AGTAACGCCA 


6420 


ATAGGGACTT 


TCCATTGACG 


TCAATGGGTG 


GAGTATTTAC 


GGTAAACTGC 


CCACTTGGCA 


6480 


GTACATCAAG 


TGTATCATAT 


GCCAAGTACG 


CCCCCTATTG 


ACGTCAATGA 


CGGTAAATGG 


6540 


CCCGCCTGGC 


ATTATGCCCA 


GTACATGACC 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


6600 


TACGTATTAG 


TCATCGCTAT 


TACCATGGTG 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


6660 


GGATAGCGGT 


TTGACTCACG 


GGGATTTCCA 


AGTCTCCACC 


CCATTGACGT 


CAATGGGAGT 


6720 


TTGTTTTGGC 


ACCAAAATCA 


ACGGGACTTT 


CCAAAATGTC 


GTAACAACTC 


CGCCCCATTG 


6780 


ACGCAAATGG 


GCGGTAGGCG 


TGTACGGTGG 


GAGGTCTATA 


TAAGGAGAGC 


TCGTTTAGTG 


6840 


AACCGTCAGA 


TCGCCTGGAG 


ACGCCATCCA 


CGCTGTTTTG 


ACCTCCATAG 


AAGACACCGG 


6900 
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1 



5 



GACCGATCCA 


GCCTCCGCGG 


CCGGGAACGG 


TGCATTGGAA 


CGCGGATTCC 


CCGTGCCAAG 


6960 


AGTGACGTAA 


GTACCGCCTA 


TAGAGTCTAT 


AGGCCCACCC 


CCTTGGCTTC 


TTATGCATGC 


7020 


TATACTGTTT 


TTGGCTTCGG 


GTCTATACAC 


CCCCGCTTCC 


TCATGTTATA 


GGTGATGGTA 


7080 


TAGCTTAGCC 


TATAGGTGTG 


GGTTATTGAC 


CATTATTGAC 


CACTCCCCTA 


TTGGTGACGA 


7140 


TACTTTCCAT 


TACTAATCCA 


TAACATGGCT 


CTTTGCCACA 


ACTCTCTTTA 


TTGGCTATAT 


7200 


GCCAATACAC 


TGTCCTTCAG 


AGACTGACAC 


GGACTCTGTA 


TTTTTACAGG 


ATGGGGTCTC 


7260 


ATTTATTATT 


TACAAATTCA CATATACAAC ACCACCGTCC 


CCAGTGCCCG 


CAGTTTTTAT 


7320 


TAAACATAAC 


GTGGGATCTC 


CACGCGAATC 


TCGGGTACGT 


GTTCCGGACA 


TGGGCTCTTC 


7380 


TCCGGTAGCG 


GCGGAGCTTC 


TACATCCGAG 


CCCTGCTCCC 


ATGCCTCCAG 


CGACTCATGG 


7440 


TCGCTCGGCA 


TCTCCTTGCT CCTAACAGTG GAGGCCAGAC 


TTAGGCACAG 


CACGATGCCC 


7500 


ACCACCACCA 


GTGTGCCGCA 


CAAGGCCGTG 


GCGGTAGGGT 


ATGTGTCTGA 


AAATGAGCTC 


7560 


GGGGAGCGGG 


CTTGCACCGC 


TGACGCATTT 


GGAAGACTTA 


AGGCAGCGGC 


AGAAGAAGAT 




GCAGGCAGCT GAGTTGTTGT 


GTTCTGATAA 


GAGTCAGAGG 


TAACTCCCGT 


TGCGGTGCTG 


7680 


TTAACGGTGG 


AGGGCAGTGT 


AGTCTGAGCA 


GTACTCGTTG 


CTGCCGCGCG 


CGCCACCAGA 


7740 


CATAATAGCT 


GACAGACTAA 


CAGACTGTTC 


CTTTCCATGG 


GTCTTTTCTG 


CAGTCACCGT 


7800 


CCTTGACACG 


AAGCTTGGGC 


TGCAGGTCGA 


TCGACTCTAG 


AGGATCGATC 


CCCGGGCGAG 


7860 


CTCG 












7864 



20 



25 



35 



WO 96740921 



-90- 



PCT/US96/09287 



WHAT IS CLAIMED IS : 

1 

1. A CDR-grafted antibody capable of 
inhibiting human tissue factor wherein the 
complementarity determining regions (CDRs) are derived 

5 from a non-human monoclonal antibody against tissue 
factor and the framework (FR) and constant (C) regions 
are derived from one or more human antibodies. 

2 . The CDR-grafted antibody of Claim 1 
wherein said non-human monoclonal antibody is a murine 

10 antibody. 

3. The CDR-grafted antibody of Claim 2 
wherein said murine antibody is TF8-5G9. 

4. The CDR-grafted antibody of Claim 1 
wherein said CDRs of the heavy chain have the amino acid 

15 sequences : 

CDR1 DDYMH (SEQ ID NO: 5) 

CDR2 L I DP ENGNT I YDPKFQG (SEQ ID NO: 6) 

CDR3 DNSYYFDY (SEQ ID NO: 7) 

and said CDRs of the light chain have the amino acid 
20 sequences : 

CDR1 KASQD I RKYLN (SEQ ID NO: 8) 

CDR2 YATSLAD :(SEQ ID NO: 9) 

CDR3 LQHGESPYT (SEQ ID NO: 10). 

5. The CDR-grafted antibody of Claim 1 

25 wherein the FR of the heavy chain is derived from the 
human antibody KOL. 

6. The CDR-grafted antibody of Claim 1 
wherein the FR of the light chain is derived from the 
human antibody REI. 

30 
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7. The CDR-grafted antibody of Claim 1 

1 wherein the heavy chain variable region has the amino 
acid sequence of SEQ ID NO: 11. 

8. The CDR-grafted antibody of Claim 1 or 7 
wherein the light chain variable region has the amino 

5 acid sequence of SEQ ID NO: 12. 

9- The CDR-grafted antibody of Claim 1 
wherein the heavy chain variable region has the amino 
acid sequence of SEQ ID NO: 13. 

10. The CDR-grafted antibody of Claim 1 or 9 
10 wherein the light chain variable region has the amino 

acid sequence' of SEQ ID NO: 14. 

11. The CDR-grafted antibody of Claim 1 
wherein the heavy chain constant region is the human 
Ig64 constant region. 

15 12. The CDR-grafted antibody of Claim 10 

wherein the heavy chain constant region Is the human 
IgG4 constant region. 

13. The CDR-grafted antibody of Claim 1 
wherein the light chain constant region is the human 

20 kappa constant region. 

14. The CDR-grafted antibody of Claim 10 
wherein the light chain constant region is the human 
kappa constant region. 

15. CDR-grafted monoclonal antibody TF8HCDR1 
25 X TF8LCDR1 . 

16. CDR-grafted monoclonal antibody TF8HCDR20 
x TF8LCDR3 . 

17. A fragment of the CDR-grafted antibody of 
Claim 1 wherein said fragment is capable of inhibiting 

30 human tissue factor. 
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18. The fragment of Claim 17 wherein said 
1 fragment is an Fab or F(ab') 2 fragment. 

19. A method of making the CDR-grafted 
antibody of Claim 1 comprising cotransfecting a host 
cell with an expression vector comprising a nucleic acid 

5 encoding the CDR-grafted antibody heavy chain and an 
expression vector comprising a nucleic acid encoding the 
CDR-grafted antibody light chain; culturing the 
transfected host cell; and recovering said CDR-grafted 
antibody. 

10 20. A method of making the CDR-grafted 

antibody of Claim 1 comprising trans fecting a host cell 
with an expression vector comprising a nucleic acid 
encoding the CDR-grafted antibody heavy chain and a 
nucleic acid encoding the CDR-grafted antibody light 

15 chain; culturing the transfected host cell; and 
recovering said CDR-grafted antibody. 

21. The method of Claim 18 or 19 wherein said 
nucleic acid encoding the CDR-grafted antibody heavy 
chain has the sequence of nucleotides 1-2360 of SEQ ID 

20 NO: 15. 

22. The method of Claim 18 or 19 wherein said 
nucleic acid encoding the CDR-grafted light chain has 
the sequence of nucleotides 1-759 of SEQ ID NO: 17. 

23. The method of Claim 19 or 20 wherein said 
25 host. cell is a bacterial cell, yeast cell, insect cell 

or mammalian cell. 

24. The method of Claim 23 wherein said 
mammalian cell is a CHO cell, COS cell or myeloma cell. 

25. The method of Claim 19 wherein said 

30 expression vector comprising a nucleic acid encoding the 
CDR-grafted antibody heavy chain is pEe6TF8HCDR20 • 
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26. The method of Claim 19 wherein said 

1 expression vector comprising a nucleic acid encoding the 
CDR-grafted antibody light chain is pEe!2TF8LCDR3 . 

27 . A nucleic acid encoding the heavy chain 
of the CDRrgrafted antibody of Claim 1. 

5 28. A nucleic acid encoding the light chain 

of the CDR-grafted antibody of Claim 1. 

29 . The nucleic acid of Claim 27 having the 
sequence of nucleotides 1-2360 of SEQ ID NO: 15. 

30. The nucleic acid of Claim 28 having the 
10 sequence of nucleotides 1-759 of SEQ ID NO: 17. 

31. A method of attenuation of coagulation 
comprising administering a therapeutically effective 
amount of a CDR-grafted antibody capable of inhibiting 
human tissue factor to a patient in need of said 

15 attenuation. 

32. The method of Claim 31 wherein said CDR- 
grafted antibody is TF8HCDR20 x TF84CDR3. 

33. A method of treatment or prevention of 
thrombotic disorder comprising administering a 

20 therapeutically effective amount of a CDR-grafted 

antibody capable of inhibiting human tissue factor to a 
patient in need of said treatment or prevention. 

34. The method of Claim 33 wherein said 
thrombotic disorder is intravascular coagulation, 

25 arterial restenosis or arteriosclerosis. 

35. The method of Claim 33 or 34 wherein said 
CDR-grafted antibody is TF8HCDR20 x TF8LCDR3. 

36. A pharmaceutical composition comprising 
at least one CDR-grafted antibody capable of inhibiting 

30 human tissue factor and a pharmaceutical ly acceptable 
carrier. 



35 



WO 96/40921 



-94-- 



PCT/US96/09287 



37. The pharmaceutical composition of Claim 
36 wherein said CDR-grafted antibody is TF8HCDR20 x 
TF8LCDR3 . 
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Sequence of the murine TF8-5G9 heavy chain cDNA wrth protein 
translation. The essential regions of the cDNA are as follows: 



Nucleotides 

I- 10 

II- 67 
68-418 
419-1390 
1391-1489 



Sequence Range: 1 to 1489 



Region 

5* untranslated region. 

Start codon and leader sequence. 

Variable region. 

Murine IgGl constant region. 

3' untranslated region. 



10 20 30 40 

* ♦ * ♦ 

CGT CCT TAC A ATC AAA TCC ACC TGC GTC ATC TTC TTC CTG ATC CCA CTG 
CCA CCA ATC T TAC TIT ACC TCC ACC CAG TAG AAC AAC GAC TAC CCT CAC 
Met Lyo Cyo Sex Trp Val lie Phe Phe Leu Met Ala Val> 



50 60 70 BO 



90 



* 

CTT 
CAA 
Val 


ACA 
TCT 
Thr 


* 

CCG CTC 
CCC CAG 
Gly Val 


AAT 
TTA 
Asn 


TCA 
ACT 
Ser 


• 

CAC 
CTC 
Clu 


ATT 
TAA 
He 


CAC 
GTC 
Gin 


CTC 
GAC 
Leu 


* 

CAC 
GTC 
Gin 


CAG 
GTC 
Gin 


TCT 
AGA 
Ser 


• 

GCC CCT 
CCC CCA 
Gly Ala 


CAG 
CTC 
Glu> 


100 
* 






110 
* 




120 
* 






130 






140 




CTT 
GAA 
Leu 


GTC 
CAC 
Val 


ACC 
TCC 

AXQ 


CCA 
CGT 
Pro 


GGG 
CCC 
Gly 


CCC 
CCG 
Ala 


TTA 
AAT 
Leu 


CTC 
CAG 
Val 


AAC 
TTC 
Lyo 


TTC 
AAC 
Leu 


TCC 
ACG 
Ser 


TCC 
ACG 
Cys 


AAA 
TTT 
Lyo 


CCT 
CCA 
Ala 


• 

TCT 
AGA 
Ser 


CCC 
CCC 
Cly> 


150 
* 






160 
• 






170 
* 




180 






190 




TTC 
AAC 
Phe 


AAC 
TTC 
Asn 


ATT AAA 

TAA "TIT 
lie Lya 


GAC 
CTG 
Asp 


TAC 
ATC 

Tyr 


TAT 
ATA 
Tyr 


ATC 
TAC 
Met 


CAC 
GTC 
His 


TGC 
ACC 

Trp 


GTC 
CAC 
Val 


AAC 
TTC 
Lya 


CAC 
CTC 
Gin 


ACG 
TCC 
Arg 


* 

CCT 
CCA 
Pro 


CAA 

CTT 

ciu> 




200 
• 




210 
* 






220 
* 






230 
* 




240 


CAC 
CTC 
Gin 


GCC 
CCG 
Gly 


CTC 
CAC 
Leu 


CAG 
CTC 
Glu 


TCC 
ACC 
Trp 


ATT 
TAA 
He 


CCA 
CCT 
Gly 


TTC 
AAC 
Leu 


ATT 
TAA 
lie 


GAT 
CTA 
Asp 


CCT 
GCA 
Pro 


GAG 
CTC 
Glu 


AAT 
TTA 
Asn 


CGT 
CCA 
Gly 


AAT 
TTA 
Asn 


ACT 
TCA 
Thr> 






250 
• 






260 
* 




270 






280 








ATA 
TAT 
He 


TAT 
ATA 

Tyr 


CAC 
CTC 
A8p 


CCG 
CCC 
Pro 


AAC 

TTC 
Lys 


TTC 
AAC 
Phe 


CAC 
CTC 
Gin 


GCC 
CCC 
Gly 


AAC 

TTC 
Lys 


CCC 
CCC 
Ala 


ACT 
TCA 
Ser 


ATA 
TAT 
lie 


ACA 

TCT 
Thr 


CCA 
CCT 
Ala 


CAC 
CTC 
Asp 


ACA 

TCT- 
Thr> 


290 
* 




300 
• 






310 
• 






320 




330 






TCC 
ACC 
Sex 


TCC 
ACC 
Sex 


AAC 
TTC 
Asn 


ACA 

TCT 
Tnx 


GCC 
CCG 

Ala 


TAC 
ATC 
Tyr 


CTG 
GAC 
Leru 


CAG 
GTC 
Gin 


CTC 
CAG 
Leu 


ACC 
TCC 
Sex 


ACC 
TCC 
Ser 


CTG 
GAC 
Leu 


ACA 

TCT 
Thr 


• 

TCT 
ACA 
Sex 


GAG 
CTC 
Glu 


GAC 
CTG 
Asp> 
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FIG. 1 B 

350 360 370 3B0 



ACT 
TGA 
Thr 


GCC 
CCG 
Ala 


GTC 
CAG 
Val 


TAT 
ATA 
Tyr 


TAC 
ATG 
Tyr 


TGT CCT ACA GAT AAC TCC TAC 
ACA CGA TCT CTA TTG AGC ATG 
Cys Ala Arg Asp Asn Ser Tyr 


TAC 
ATG 

Tyr 


TTT 
AAA 

Phe 


CAC TAC 
CTG ATG 
Asp Tyr> 


390 
* • 






400 
» 






410 420 
* * 






430 


TGG GCC 
ACC CCG 
Trp Gly 


CAA GGC 
CTT CCG 
Gin Gly 


ACC 
TGG 
Thr 


ACT 
TGA 
Thr 


CTC 
GAG 
Leu 


ACA 
TGT 
Thr 


CTC TCC TCA GCC 
CAG AGC ACT CCC 
Val Ser Ser Ala 


AAA 
ITT 
Lys 


* 

ACG ACA CCC 
TGC TCT GGG 
Thr Thr Pro> 




440 
* 




450 
* 






460 
* 


470 




480 


CCA 
GGT 
Pro 


TCT 
ACA 
Ser 


GTC 
CAG 
Val 


TAT 
ATA 
Tyr 


CCA 
GGT 
Pro 


CTG 
GAC 
Leu 


GCC 
CGG 
Ala 


CCT 
GGA 
Pro 


GGA TCT GCT 
CCT ACA CGA 
Gly Ser Ala 


GCC 
CCG 
Ala 


CAA 

GTT 
Gin 


ACT 
TGA 
Thr 


* 

AAC TCC 
TTG AGG 
Asn Ser> 






490 
* 






500 
* 




510 
* 




520 
* 






ATG 
TAC 
Met 


GTG 
CAC 
Val 


ACC 
TGG 
Thr 


CTG 
CAC 
Leu 


GGA TCC 
CCT ACG 
Gly Cya 


CTG 
GAC 
Leu 


CTC 
CAG 
Val 


AAG GGC TAT TTC 
TTC CCG ATA AAG 
Lys Gly Tyr Phe 


CCT 
CGA 

Pro 


CAG 
CTC 
Glu 


CCA CTC 
GCT CAC 
Pro Val> 


530 
• 




540 
* 






550 
•* 




560 
* 




570 




ACA 

TCT 
• Thr 


GTG 
CAC 
Val 


ACC 
TGG 
Thr 


TGG 
ACC 
Trp 


AAC 
TTC 
Asa 


TCT 
ACA 
Ser 


GGA 
CCT 
Gly 


TCC 
AGG 
Ser 


CTG TCC AGC GGT 
GAC AGG TCC CCA 
Leu Ser Ser Gly 


GTG 
CAC 
Val 


CAC 
GTG 
His 


ACC TTC 
TGG AAG 
Thr Phe> 


580 
* 






590 
• 




600 
* 


610 
• 






620 


CCA 
GGT 
Pro 


CCT 
CCA 
Ala 


GTC 
CAG 

val 


CTG 
GAC 
Leu 


CAC 
GTC 
Gin 


TCT 
ACA 
Ser 


CAC 
CTG 
Asp 


CTC 
GAC 
Leu 


TAC ACT CTC 
ATC TGA GAC 
Tyr Thr Lou 


AGC 
TCC 
Ser 


AGC 
TCC 
Ser 


TCA 
ACT 
Ser 


CTG ACT 
CAC TGA 
Val Thr> 


630 
* 






640 
• 






650 660 
* * 






670 


GTC 
CAC 
Val 


CCC 
GGC 
Pro 


TCC 
AGC 
Ser 


ACC 
TCC 
Ser 


ACC 
TGG 
Tnr 


TCC 
ACC 
Trp 


CCC 
GGC 
Pro 


ACC 
TCC 
Ser 


GAG ACC GTC 
CTC TGG CAG 
Glu Thr Val 


ACC 
TGG 
Thr 


TCC 
ACC 
Cys 


AAC 

TTG 
Asn 


CTT GCC 
CAA CGG 
Val Ala> 




680 
• 




690 
• 






700 
• 


710 
* 




720 


CAC 
GTC 
His 


CCC 
GGC 
Pro 


CCC 
CCC 
Ala 


ACC 
TCC 
Ser 


ACC 
TCC 
Ser 


ACC 
TGG 
Thr 


AAC 
TTC 
Lys 


GTC 
CAC 
Val 


GAC AAG AAA 

CTG TTC. TIT 
Asp Lys Lys 


ATT 
TAA 
He 


CTC 
CAC 
Val 


CCC AGC GAT 
CGG TCC CTA 
Pro Arg Asp> 






730 
• 






740 
* 




750 
• 




760 
* 






TCT 
ACA 
Cyo 


GGT 
CCA 
Gly 


TGT 
ACA 
Cys 


AAG 
TTC 
Lys 


CCT 
CCA 
Pro 


TCC 
ACC 
Cyo 


ATA 
TAT 
He 


TCT 
ACA 
Cys 


ACA CTC CCA 
TGT CAC GGT 
Thr Val Pro 


CAA 

CTT 
Glu 


CTA 
CAT 
Val 


TCA 
ACT 
Sex 


TCT GTC 
AGA CAG 
Ser Val> 


770 
• 




780 
* 






790 
• 




800 
* 






810 
* 




TTC 
AAC 
Phe 


ATC 
TAG 
He 


TTC 
AAG 
Phe 


CCC 
GGG 
Pro 


CCA 
GGT 
Pro 


AAG 

TTC 
Lye 


CCC 
CGG 
Pro 


AAC 
TTC 
Lys 


CAT CTG CTC 
CTA CAC CAG 
Asp Val Leu 


ACC 

' TCC 
Thr 


ATT 
TAA 
He 


ACT 
TGA 
Thr 


CTC ACT 
CAC TGA 
Leu Thr> 
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FIG. 1 C 

820 330 840 BSO 860 

• * • . * # 

CCT AAG GTC ACC TCT GTT CTC CTA CAC ATC AOC AAC OAT GAT CCC GAG 
GGA TTC CAG TGC ACA CAA CAC CAT CTC TAG TOO TIC CTA CTA GGG CTC 
Pro Lys Val Thr Cys Val Val Val Asp lis Ser Lys Asp Asp Pro Glu> 



870 880 890 



900 







910 
• 




CAC 


ACA 


CCT 


CAC 


CTG 


TGT 


CGA 


GTC 


His 


Thr 


Ala 


Gln> 



920 930 940 950 



990 1000 







* 






* 






ACC 


CAA 


CCC 


CCC 


GAG 


GAG 


CAG 


CTC 


TCC 


CTT 


GGG 


CCC 


CTC 


CTC 


GTC 


AAG 


Thr 


Gin 


Pro 


Arg 


Glu 


Glu 


Gin 


Phe 






970 






980 








* 








* 




CAA 


CTT 


CCC 


ATC 


ATC 


CAC 


CAG 


CAC 


CTT 


CAA 


GGG 


TAG 


TAC 


GTC 


CTC 


CTG 


Glu 


Lou 


Pro 


lie 


Met 


His 


Gin 


Asp 


1C10 
• 




1020 




1030 




TCC 


AGG 


CTC 


AAC 


ACT 


CCA 


CCT 


TIC 


ACC 


TCC 


CAC 


TTG 


TCA 


CCT 


CGA 


AAC 


Cys 


Arg 


Val 


Asn 


Sex 


Ala 


Ala 


Phe 



960 



1040 1050 



* 






• 






CCC 


ATC 


CAC 


AAA 


ACC 


ATC 


GGG 


TAG 


CTC 


TIT 


TCC 


TAG 


Pro 


He 


Glu 


Lys 


Thr 


Ile> 



10 " "70 1080 . 1090 uoo 



* 








• 






• 


TCC 


AAA 


ACC 


AAA 


GGC 


ACA 


CCC 


AAG 


ACC 


ITT 


TCC 


TTT 


CCC 


TCT 


GGC 


TTC 


Ser 


Lys 


Thr 


Lys 


Cly 


Arg 


Pro 


Ly» 


1110 
• 




1120 






11: 


CCT 


CCC 


AAC 


CAG 


CAC 


ATC 


GCC 


AAC 


GGA 


GGG 


TTC 


CTC 


GTC 


TAC 


CCC 


TTC 


Pro 


Pro 


Lys 


ciu 


Gin 


Met 


Ala 


Lys 



CAG 


GTC 


TAC 


ACC 


ATT 


CCA 


CTC 


CAC 


ATC 


TCC 


TAA 


CCT 


Gin 


Val 


Tyr 


Thr 


He 


Pro> 


1140 
• 




1150 




CTC 


ACT 


CTC 


ACC 


TCC 


ATC 


CAG 


TCA 


CAC 


TCC 


ACC 


TAC 


Val 


Ser 


Leu 


Thr 


Cys 


tteO 



1160 1170 HBO 1190 

*' • 





♦ 






• 






ATA 


ACA CAC 


TTC 


TTC 


CCT 


CAA 


CAC 


TAT 


TCT CTC 


AAG 


AAG 


GGA 


CTT 


CTC 


He 


Thr Asp 


Phe 


Phe 


Pro 


Glu 


Asp 



1200 



1210 1220 1230 



1240 







• 






• 






• 


CCC 


CAC 


AAC TAC 


AAG 


AAC 


ACT 


CAC 


CCC 


ATC 


CCC 


CTC 


TTC ATC 


TTC 


TTC 


TCA 


CTC 


GGC 


TAG 


Ala 


Glu 


Asa Tyr 


Lys 


Asn 


Thr 


Gin 


Pro 


lie 



1250 12 M 1270 1280 



1290 



^ ATC 010 *w*»TOwe«*accxcTOSTO 

A«p Gly Ser Tyr Phe Val Tyr Sex Ly Leu An Val cla Lye Ser A«a> 
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FIG. 1 D 

1300 1310 1320 1330 1340 

* *. * ♦ * 

TCC GAC CCX CCA AM ACT TTC ACC TCC TCT GTC TTA CAT GAG CCC CTC 
ACC CTC CGT CCT TTA TGA AAG TOO ACC ACA CAC AAT CTA CTC CCC GAC 
Trp Glu Ala Cly Ara Thx Pho Thr Cy* Sox Val Leu His Clu Cly Leu> 

1350 13S0 1370 1380 1390 

CAC AAC CAC CAT ACT CAC AAC ACC CTC TCC CAC TCT CCT CCT AAA TC ATC 
GTC TTC CTC CTA TGA CTC TTC TCC GAC ACC CTC AGA GGA CCA TIT AC TAG 
Hie Aim Hla Hio Thr Glu Lys 9er L«u Sor His Sex Pro Gly Lyo> 



1400 1410 1420 1430 



1440 



CCA CTC TCC TTC GAG CCC TCT CCT CCT ACA GGA CTC TGA CAC CTA CCT 
GCT CAC ACG AAC CTC GGG AGA CCA GGA TCT CCT GAC ACT CTC GAT GGA 

1450 1460 1470 1480 

♦ 

CCA CCC CTC CCT CTA TAA ATA AAG CAC CCA CCA CTC CCT TCC ACC C 
CGT GGG GAG GGA CAT ATT TAT TTC GTC GGT CGT GAC GGA ACC TCC G 
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Sequence of the murine TF8-5G9 light chain cDNA with protein 
translation. The essential regions of the cDNA are as follows: 



FIG. 2 A 



Nucleotides 

1-4 

5-64 

65-385 

386-706 

707-917 

918-937 



Region 

5* untranslated. 

Start codon and leader sequence. 

Variable region. 

Murine kappa constant region. 

3 1 untranslated region.. 

Poly A tail. 



Sequence Range: 1 to 937 



10 



20 



30 



40 



CCA C ATG CGG CCC CCT OCT CAC TTT TTT CCC ATC TTC TTC CTC TCC TXT 
CCT G TAC CCC CCC GGA CCA GTC AAA AAA CCC TAB AAC AAC GAG ACC AAA 
Met Ary Ala Pro Ala Gin Phe Phe Gly He Leu Leu Leu Trp Phe> 



50 



60 



70 



80 



90 



* 






•* 






* 








* 


CCA 


OCT 


ATC 


AGA 


TCT 


GAC 


ATC 


AAC 


ATG 


ACC 


CAC TCT 


GGT 


CCA 


TAG 


TCT 


ACA 


CTC 


TAG 


TTC 


TAC 


TCG 


CTC AGA 


Pro 


Gly 


He 


Arg 


Cya 


Asp 


Ho 


Lys 


Met 


Tnr 


Gin Ser 


100 






110 




120 






130 


* 








* 






* 






• 


TAT 


CCA 


TCG 


CTC 


GGA 


GAC 


AGA 


CTC 


ACT 


ATC 


ACT TCT 


ATA 


CCT 


ACC 


GAC 


CCT 


CTC 


TCT 


CAC 


TCA 


TAC 


TCA ACA 


Tyr 


Ala 


Ser 


Leu 


Gly 


Clu 


Ary 


Val 


Thr 


lie 


Thr Cys 



140 



150 



160 



170 



180 





• 






• 








• 






• 


GAC 


ATT 


AGA 


AAC 


TAT 


TTA 


AAC 


TCG 


TAC 


CAC 


CAC 


AAA 1 


CTG 


TAA 


TCT 


TTC 


ATA 


AAT 


TTG 


ACC 


ATC 


CTC 


CTC 


TTT 1 


Asp 


lie 


Ary 


Lys 


Tyr 


Leu 


Asn 


Trp 


Tyr 


Gin 


Gin 


Lys 




200 
• 




210 






220 
• 






23 


CCT 


AAG 


ACC 


CTG 


ATC 


TAT 


TAT 


CCA 


ACA 


ACC 


TTG 


CCA 


GGA 


TTC 


TCC 


GAC 


TAG 


ATA 


ATA 


CCT 


TOT 


TCC 


AAC 


CCT 


Pro 


Lys 


Thr 


Leu 


He 


Tyr 


Tyr 


Ala 


Tnr 


Ser 


Leu 


Ala 



250 



260 



270 



280 



TCA 


ACA 


TTC 


ACT 


CCC 


ACT 


CCA 


TCT 


ACT 


TCT 


AAC 


TCA 


CCC 


TCA 


CCT 


ACA 


Ser 


Ary 


Phe 


Ser 


Gly 


Ser 


Gly 


Ser 


10 




300 






310 




• 






• 










ACC 


ACC 


CTC 


GAG 


TCT 


GAC 


CAT 


ACA 


TCC 


TCC 


GAC 


CTC 


ACA 


CTC 


CTA 


TCT 


Ser 


Ser 


Leu 


Glu 


Ser 


Asp Asp 


Thr 



320 



330 



• 




ACT 


CAC 


TCA 


GTC 


Ser 


Gln> 


190 




* 




AAA 


TCT 


TTT 


AGA 


Lys 


Ser> 


240 




* 


CTC 


CCA 


CAC 


GGT 


Val 


Pro> 


ACC 


ATC 


TGG 


TAC 


Thr 


Ile> 


CAA 


CAT 


GTT 


CTA 
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FIG. 2B 



340 350 360 370 



380 



GGT GAG 
CCA CTC 
Gly Glu 


AGC 
TCG 
Ser 


CCC 
GGC 
Pro 


TAC ACC 
ATG TCC 
Tyr Thr 


TTC 
AAG 
Phe 


GGA GGG GGG 
CCT CCC CCC 
Gly Gly Gly 


ACC 
TCC 
Thr 


AAC CTC 
TTC CAC 
Lya Leu 


CAA 
CTT 
Glu 


ATA AAC 
TAT TTG 
Xle Aan> 


390 
* 






400 
• 






410 
* 




420 






430 




AGG 
TCC 
Axg 


GCT GAT 
CCA CTA 
Ala Asp 


CCT 
CCA 
Ala 


CCA 
CCT 
Ala 


CCA 
GGT 
Pro 


ACT 

TCA 
Thr 


GTA 
CAT 
Val 


TCC 
AGG 
Sex 


ATC 
TAG 
Xle 


TTC 
AAG 
Phe 


CCA CCA TCC 
GGT GCT AGG 
Pro Pro Ser 


* 

ACT 
TCA 
Sex 


GAG 

CTC 
Glu> 




440 
* 




450 
* 






460 
* 






470 




480 


CAG 
CTC 
Gin 


TTA 
AAT 
Leu 


ACA 
TGT 
Thr 


TCT 
AGA 
Ser 


CCA 
CCT 
Gly 


GCT 
CCA 
Gly 


CCC 
CGG 
Ala 


TCA 
ACT 
Sex 


GTC 
CAG 
Val 


GTC 
CAC 
Val 


TCC 
ACC 
Cya 


TTC 
AAC 

Phe 


TTC 
AAC 
Leu 


AAC 
TTG 
Asn 


AAC 

TTG 
Asn 


* 

TTC 
AAG 
Phe> 






490 
• 






500 
* 




510 
* 






520 
* 








TAC 
ATG 

Tyr 


CCC 
GGG 

Pro 


AAA 

TIT 
Lya 


CAC 
CTC 
Asp 


ATC 
TAC 
Xle 


AAT 
TTA 
Asn 


CTC 
CAG 
Val 


AAG 

TTC 
Lya 


TCG 
ACC 
Trp 


AAG 
TTC 
Lya 


ATT 
TAA 
lie 


GAT 
CTA 
Asp 


GGC ACT 
CCC TCA 
Gly Ser 


CAA 
CTT 
Glu 


CCA 
CCT 
Arg> 


30 
* 




540 
* 






550 
* 






560 
* 




570 
* 






CAA 
GTT 
Gin 


AAT 
TTA 
Aim 


CCC 
CCC 
Gly 


CTC 
CAG 
Val 


CTC 
CAC 
Leu 


AAC 

TTG 
Asn 


ACT 
TCA 
Ser 


TCG 
ACC 
Trp 


ACT 
TCA 
Thr 


GAT 
CTA 
Asp 


CAC 
GTC 

Gin 


GAC 
CTC 
Asp 


AGC 
TCG 
Ser 


AAA 

TTT 
Lys 


GAC 
CTC 
Asp 


ACC 
TCC 
Ser> 


580 
* 






590 
* 




600 
* 






610 
* 






620 




ACC 
TCG 
Thr 


TAC 
ATG 
Tyr 


ACC 
TCG 
Sex 


ATG 
TAC 
Met 


ACC 
TCC 
Ser 


ACC 
TCC 
Ser 


ACC 
TCG 
Thr 


CTC 
GAG 
Leu 


ACC 
■TCC 
Thr 


TTG 
AAC 
Leu 


ACC 
TCG 
Thr 


AAG 
TTC 
Lya 


GAC 
CTC 
Asp 


GAC 
CTC 
Glu 


* 

TAT 
ATA 
Tyr 


CAA 

CTT 
Glu> 


£30 
* 






640 

<* 






650 
* 




660 






670 




CCA 
GCT 
Arg 


CAT 
CTA 
His 


AAC 
TTC 
Am 


ACC 
TCC 
Ser 


TAT 
ATA 

Tyr 


ACC 
TCG 
Thr 


TCT 
ACA 
Cya 


GAG 
CTC 
Glu 


CCC 
CGG 

Ala 


ACT 
TCA 
Thr 


CAC 
CTC 
Hifi 


AAG 
TTC 
Lya 


ACA 

TCT 
Thr 


TCA 
ACT 
Ser 


• 

ACT 
TCA 
Thr 


TCA 
ACT 
Ser> 




680 
* 




690 
• 






700 
* 






710 




720 


CCC 
GGG 
Pro 


ATT 
TAA 
lie 


CTC 
CAG 
Val 


AAC 

TTC 

Ly« 


ACC 
TCC 
Ser 


TTC 
AAC 
Efee 


AAC 
TTC 
Am 


ACC 
TCC 
Arg 


AAT 
TTA 
Am 


GAG 
CTC 
Glu 


TCT TA GAG ACA AAG CTC CTC 
ACA AT CTC TCT TTC CAC CAC 
Cy*> 




730 
• 




740 
• 






750 
* 






760 




770 


ACA 

tct 


CCC 
CCC 


CAC 

CTC 


CAC 
CTC 


CAC 
CTC 


CTC 
GAG 


CCC 
CGG 


AGC 
TCG 


TCC 
ACC 


ATC 
TAC 


CTA 
GAT 


TCT 
AGA 


TCC 
AGG 


CTT 
CAA 


CTA 
GAT 


• 

AGG 
TCC 






780 
• 






790 




BOO 






810 








TCT 
ACA 


TCG 
ACC 


ACC 

TCC 


CTT 
CAA 


CCC 
CCC 


CAC 
CTC 


AAC 
TTC 


CCA 
CCT 


CCT 
GGA 


ACC 
TCG 


ACT 
TCA 


CTT 
CAA 


* 

GCC 
CCC 


CTC 
CAC 


CTC 
CAC 


CAA 

CTT 
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FIG. 2 C 



820 



830 



840 



850 
* 



860 



ACCTCCTCCCCXOTOTTCTCCTOTOTCCCTTO CCT TCC CTT TTA 
TCCACCMGCGTCGACGAAGXCaACCA GGA' GGG AAA GGA ACC GAA AAT 



870 



880 



890 



900 



910 



TCA TCC TAA TAT TIG CAC AAA ATA TTC AAT AAA GTC ACT CTT TCC ACT 
ACT ACC ATT ATA AAC CTC TTT TAT AAG TTA TIT CAC TCA CAA ACC TCA 



920 



930 



TCA AAA AAA AAA AAA AAA AAA A 

ACT TTT TTT TTT TTT TTT TTT T 
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CO 




! 
1 

Q 

O 

CQ 

P: 



i 

ci 
Ci 
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_ The pEe6TF8HCDR20 expression vector DNA sequence The 

FIG 4 A codin 9 re 9 ions of TF8-5G9 CDR-grafted HC gene, TF8HCDR20 
' ^ n are translated. 

Sequence Range: 1 to 7073 

10 20 30 40 



GAA 
CTT 


TTC 
AAG 


GCC 
CGG 


GCC 
CGG 


ACC 
TGG 


ATG 
TAC 
Met 


GAA TGG 
CTT ACC 
Glu Trp 


ACC TGG 
TCC ACC 
Ser Trp 


GTC TTT 
CAC AAA 
Val Phe 


CTC 
GAG 
Leu 


TTC 
AAG 
Phe 


TTC 
AAG 
Phe 


TTG 
AAC 
Leu> 


50 
• 






60 
* 






70 
* 






80 
• 






90 
* 






TCA 
ACT 
Ser 


CTA 
CAT 
Val 


ACT ACA GCT 
TGA TCT CCA 
Thr Thr Gly 


GTA 
CAT 
Val 


CAC 
CTC 
His 


TCA 
ACT 
Ser 


CAA GTT 
GTT CAA 
Gin Val 


CAC 
CTC 
Gin 


CTG 
GAC 
Leu 


GTC 
CAC 
Val 


GAG 
CTC 
Glu 


TCT 
AGA 
Ser 


GGA 
CCT 
Gly> 


100 
• 




110 
* 






120 
* 






130 




140 




GGA 
CCT 
Gly 


GGA 
CCT 
Gly 


CTA 
CAT 
Val 


CTA 
CAT 
Val 


CAA 
GTT 
Gin 


CCT 
GGA 
Pro 


GGA 
CCT 
Gly 


AGG 
TCC 
Arg 


TCA 
ACT 
Ser 


CTC 
GAC 
Leu 


AGA 
TCT 
Arg 


CTG 
GAC 
Leu 


TCT 
AGA 
Ser 


TGT 
ACA 
Cys 


• 

AAG 
TTC 
LyB 


GCT 
CCA 
Ala> 




150 
• 






160 
• 




170 
* 






180 
* 






190 


ACT 
TCA 

Ser 


GGA 

CCT 
Gly 


TTC 
AAC 
Phe 


AAT 

TTX 

Asn 


ATC 
TAG 
lie 


AAG 

TTC 
Lya 


CAC 

CTC? 
Asp 


TAT 

ATA 
Tyr 


TAT ATC 
ATA TAC 
Tyr Met 


CAC 
GTC 
His 


TGG 
ACC 
Trp 


CTC 
CAC 
Val 


ACA 

TCT 
Arg 


CAA 
GTT 
Gin 


• 

GCT 
CCA 
Ala> 




200 
* 






210 
• 






220 
* 




230 






240 


CCT 
GGA 
Pro 


GGA 
CCT 
Gly 


AAA 

TTT 
Lye 


GGA 
CCT 
Gly 


CTC 
GAG 
Leu 


GAG 
CTC 
Glu 


TGG 
ACC 
Trp 


ATA 

TAT 
He 


GGT 
CCA 
Gly 


TTA 
AAT 
Leu 


ATT 
TAA 
He 


GAT 
CTA 
Asp 


CCT 
GGA 
Pro 


GAC 
CTC 
Glu 


AAT 
TTA 
Asn 


* 

GGT 
CCA 
Cly> 






250 
* 




260 
• 






270 
• 






280 






AAC 

TTG 
Asn 


ACC 
TCC 
Thr 


ATA 
TAT 
He 


TAT 
ATA 
Tyr 


GAT 
CTA 
Asp 


CCC 
CGG 
Pro 


AAG 

TTC 
Lye 


TTC 
AAG 
Phe 


CAA 
CTT 

Gin 


GGA 
CCT 
Gly 


AGA 
TCT 
Arg 


TTC 
AAC 
Phe 


ACA 

TCT 
Thr 


ATT 
TAA 
He 


TCT 
ACA 
Ser 


CCA 
CCT 
Ala> 


290 
• 






300 
* 






310 
* 




320 






330 






CAC 
CTC 
Asp 


AAC 
TTG 
Asn 


tct 

AGA 
Ser 


AAC 
TTC 
Lya 


AAT 
TTA 
Asn 


ACA 

TGT 
Thr 


CTC 
GAC 
Leu 


TTC 
AAC 
Phe 


CTG 
GAC 
Leu 


CAC 
GTC 
Gin 


ATC 
TAC 
Met 


GAC 
CTG 
Asp 


TCA 
ACT 
Ser 


• 

CTC 
GAG 
Leu 


ACA 
TCT 
Arg 


CCT 
GGA 
Pro> 


340 
* 




350 
* 






360 
• 






370 




380 




GAG 
CTC 
Glu 


GAT 
CTA 
Asp 


ACA 

TCT 
Thr 


CCA 
CCT 
Ala 


GTC 
CAC 
Val 


TAC 
ATC 
Tyr 


TAT 
ATA 

Tyr 


TGT 
ACA 
Cys 


GCT 
CCA 
Ala 


AGA 
TCT 
Arg 


GAT 
CTA 
Asp 


AAC 
TTC 
Asn 


ACT 
TCA 
Ser 


TAT 
ATA 

Tyr 


» 

TAC 
ATG 
Tyr 


TTC 
AAC 
Phe> 




390 
* 






400 
* 




410 
# 






420 
• 






430 


GAC 
CTC 
Asp 


TAC 
ATG 
Tyr 


TGG 
ACC 
Trp 


GCC 
CCC 
Gly 


CAA GGA 

CTT CCT 
Gin Gly 


ACA 
TGT 
Thr 


CCA CTC 
GGT CAC 
Pro Val 


ACC 
TGG 
Thr 


CTC 
CAC 
val 


AGC 
TCC 
Ser 


TCA 
ACT 
Ser 


GCT 
CCA 
Ala 


TCC 
AGG 
Sex 


w 

ACC 
TGG 
Thr> 
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FIG. 4 B 



440 450 460 470 480 

* 

AAG GCC CCA TCC CTC TTC CCC CTG CCG CCC TGC TCC AGG AGC ACC TCC 
TTC CCG GGT ACC CAC AAC CCG GAC CCC GGG ACC ACG TCC TCC TCG ACC 
LyB Gly Pro Ser Val Phe Pro Leu Ala Pro Cyo Ser Arg Ser Thr Ser> 



490 500 510 520 

GAC ACC ACA GCC CCC CTG CCC TGC CTG GTC AAG GAC TAG TTC CCC GAA 
CTC TCG TGT CGG CCG CAC CCG ACG GAC CAC TTC CTG ATG AAG GGG CTT 
Clu Ser Thr Ala Ala Leu Gly Cyo Lou Val Lya Asp Tyr Phe Pro Glu> 

530 540 550 560 570 

CCG GTG ACG CTC TCG TGC AAC TCA CGC GCC CTG ACC ACC CCC CTG CAC 
GCC CAC TGC CAC ACC ACC TTC ACT CCC CCG GAC TCG TCC CCC CAC CTC 
Pro Val Thr Val Ser Trp Abu Ser Gly Ala Leu Thr Ser Gly Val Hia> 

5B0 590 600 610 620 

* * * * * 

ACC TTC CCG CCT GTC CTA CAC TCC TCA CCA CTC TAC TCC CTC ACC ACC 
TGG AAG CGC CCA CAG CAT GTC ACC ACT CCT GAG ATG ACG GAC TCC TCG 
Thr Phe Pro Ala Val Leu Gin Sex Ser Gly Leu Tyr Ser Leu Scr Ser> 

630 640 650 660 670 

* • • * * 

GTG GTG ACC GTG CCC TCC AGC ACC TTG GGC ACC AAG ACC TAC ACC TCC 
CAC CAC TGC CAC CGG ACC TCC TCC AAC CCG TCC TTC TCG ATG TGG ACC 
Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Lys Thr Tyr Thr Cya> 

680 690 700 710 720 

• • * * * 

AAC CTA GAT CAC AAC CCC ACC AAC ACC AAC GTC GAC AAC ACA GTT GGT 
TTC CAT CTA CTC TTC CGG TCC TTG TGG TTC CAC CTC TTC TCT CAA CCA 
Ann Val Asp Hie Lye Pro Ser Asa &r Lya Val Asp Lys Arg Val> 

730 740 750 760 

* ♦ * 

GAC ACG CCA CCA CAC CCC ACG GAG .GGT GTC TCC TGG AAC CCA CGC TCA 
CTC TCC CCT CCT CTC CCC TCC CTC CCA CAC ACC ACC TTC CCT CCC ACT 

770 780 790 800 810 

GCC CTC CTC CCT CCA CCC ACC CCG GCT CTC CAC CCC CAC CCC ACC CCA 
CCC GAC GAC CCA CCT CCC TCG GGC CCA CAC GTC CCC CTC CCC TCC CCT 

820 830 840 850 860 

• * * ♦ * 

CCA ACC CAT CCC CCA TCT CTC TCC TCA CCC CCA CGC CTC TCA CCA CCC 
CCT TCC CTA CCC GCT ACA CAC ACG ACT CCC CCT CCC GAG ACT GCT CGG 

370 880 890 900 910 

• • * * * 

CAC TCA TCC TCA CCC ACA CCC TCT TCT CCA TTT TTC CAC CAC GCT CCC 

GTG ACT ACC ACT CCC TCT CCC ACA ACA CCT AAA AAC CTC CTC CCA GCC 
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FIG. 4 C 





920 
• 






930 
* 






940 
* 




950 
• 




960 
* 


ggc 
ccc 


AGC 
TCC 


CAC 
CTC 


ACC 
TCC 


CTC 
GAC 


GAT 
CTA 


CCC 
CCC 


CCT 
GGA 


ACC 
TCG 


CCA 
GCT 


CCC 
CCC 


CCT 
GGA 


CCC CAT ACA 
CCC GTA TCT 


GGG 
CCC 






970 
* 




9B0 






990 
* 






1000 
• 




CCA 
CGT 


CGT 
CCA 


GCT 
CCA 


CCG 
CCC 


CTC 
GAG 


AGA 
TCT 


CCT 
GGA 


CCC 
CCG 


AAG 

TTC 


ACC 
TCG 


CAT ATC 
GTA TAG 


CCC 
GCC 


GAG GAC 
CTC CTC 


CCT 
GGA 


910 
* 




1020 






1030 




1040 




1050 




CCC 
CGG 


CCT 
GGA 


GAC 
CTC 


CTA 
GAT 


ACC 
TCC 


CCA 
GCT 


CCC 
GGG 


CAA 

CTT 


AGG 
TCC 


-CCA 
GCT 


AAC 

TTC 


TCT 
AGA 


CCA 
GCT 


w 

CTC CCT 
GAC CCA 


CAG 
GTC 


1060 
* 




1070 




1080 
* 






1090 
* 




1100 
* 




CTC 
GAG 


AGA 

TCT 


CAC 
GTC 


CTT 
GAA 


CTC 
GAG 


TCC 
ACC 


TCC 
AGG 


CAC 
GTC 


ATT 
TAA 


CGA 
GCT 


CTA 
CAT 


ACT 
TCA 


CCC 
CCG 


AAT CTT 
TTA CAA 


CTC 
GAG 


1110 
* 






1120 




1130 
* 




1140 
* 




1150 


Tcr 

AGA 


CCA 
CGT 


GAG 
CTC 
ClU 


TCC 
ACC 
Ser 


AAA 

TIT 

Lys 


TAT 

ATA 

Tyr 


CCT 
CCA 
Gly 


CCC 
GGG 
Pro 


CCA 
GCT 
Pro 


TCC 
ACC 
Cyfl 


CCA 
Pro 


TCA 

ACT 
Ser 


TCC 
AOS 
Cys 


CCA CGT 
GCT CCA 
Pro> 


AAG 

TTC 




1160 




1170 






1180 
* 




1190 


1200 


CCA 
CGT 


ACC 
TCC 


CAC 
GTC 


CCC 

ccc 


TCG 
ACC 


CCC 
GGG 


TCC 
AGG 


ACC 
TCG 


TCA AGG 
ACT TCC 


CCC 
CCC 


GAC 

CTC 


AGG 

TCC 


TCC CCT 
ACC GGA 


AGA 

TCT 



1210 1220 1230 1240 

* * * * • 

GTA CCC TCC ATC CAG GGA CAG GCC CCA GCC GCC TCC TCA CCC ATC CAC 
CAT CCC ACC TAG CTC CCT CTC CCG CCT CCG CCC ACC ACT CCG TAG GTC 

1250 1260 1270 1280 1290 

• * • * * 

CTC CAT CTC TTC CTC ACC A CCT GAC TTC CTC GGG GGA CCA TCA GTC TTC 
GAG GTA GAC AAC GAG TCC T CGA CTC AAG CAC CCC CCT CGT ACT CAC AAC 

Pro Glu Phe Lou Gly Gly Pro Ser Val Phe> 



1300 
• 






1310 
• 




1320 1330 
* * 




1340 
* 




CTC 
GAC 
Leu 


TTC 
AAC 
Phe 


CCC 
CCG 
Pro 


CCA AAA 
CGT TTT 
Pro Lys 


CCC 
GGG 
Pro 


AAG 

TTC 
Lye 


GAC ACT CTC 
CTC TCA GAG 
Asp Thr Leu 


ATC ATC 
TAC TAG 
Met He 


TCC 
AGG 
Ser 


CCC ACC 
CCC TCG 
Ary Thr 


CCT 
CGA 
Pro> 


1350 
* 




1360 
* 






1370 
• 


1380 
* 




1390 
• 




CAG 
CTC 
Glu 


CTC 
CAC 
Val 


ACC 
TCC 
Thr 


TCC CTC 
ACC CAC 
Cys Val 


CTC 
CAC 
Val 


CTC 
CAC 
Val 


GAC CTC ACC 
CTC CAC TCC 
Asp Val Ser 


CAC GAA 
GTC CTT 
Gin Glu 


GAC 
CTC 
Asp 


CCC GAG 
GCC CTC 
Pro Glu 


CTC 
CAC 
Val> 



1400 1410 1420 1430 1440 

* * ♦ • » 

CAC TTC AAC TCC TAC CTC GAT CCC CTC CAC CTC CAT AAT CCC AAG ACA 
CTC AAC TTC ACC ATC CAC CTA CCC CAC CTC CAC CTA TTA CCC TTC TCT 
Gin Phe Aon Trp Tyr Val Asp Gly Val Glu Val His Aon Ala Lys Thr> 
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FIG. 4 D 



1450 1460 1470 14B0 

* * * * 

AAG CCG CCC GAG GAG CAG TTC AAC AGC ACQ TAC OCT GTG GTC AGC CTC 

TTC GGC CCC CTC CTC CTC AAC TTC TCC TCC ATG CCA CAC CAG TCC CAG 

Lys Pro Arg Glu Clu Gin Phe Asn Sex Thr Tyr Ary Val Val Sex Val> 

1490 1500 1510 1520 1530 

* * * * * 

CTC ACC GTC CTG CAC CAG GAC TGG CTC AAC GGC AAG GAG TAG AAC TGC 
GAG TGG CAG GAC GTG GTC CTG ACC GAC TTG CCC TTC CTC ATG TTC ACC 
Leu Thr Val Leu His Gin Asp Trp Leu Asn Gly Lya Glu Tyr Lye Cys> 



1540 1550 1560 

* • # 

AAC CTC TCC AAC AAA GGC CTC CCC 
TTC CAG AGC TTG ITT CCG GAG GGC 
Lyn Val Sex Asn Lye Gly Leu Pro 



1570 1580 

TCC TCC ATC CAG AAA ACC ATC TCC 
AGG AGG TAB CTC TTT TGG TAG ACC 
Sex Sex He Glu Lys Thr He Ser> 



1590 1600 1610 1620 1630 

* * * * 

AAA CCC AAA GC TGG GAC CCA CGC GGT CCG AGG CCC ACA TGG ACA GAC CTC 
TTT CGC TTT CC ACC CTG GGT GCC CCA CCC TCC CGC TCT ACC TCT CTC CAG 
Lys Ala Lys> 



1S40 1S50 1660 1670 1680 

* * * * * 

AGC TCC GCC CAC CCT CTC CCC TGG GAG TGA CCG CTG TCC CAA CCT CTC 
TCG AGC CCG GTG GGA CAC GGG ACC CTC ACT GGC GAC ACC GTT GGA GAC 

1690 1700 1710 1720 1730 

* • * • * 

TCC CTA CA CCG CAG CCC CGA GAG CCA CAG GTG TAC ACC CTC CCC CCA TCC 
AGG GAT CT CCC CTC CCG CCT CTC GGT GTC CAC ATG TGG GAC GGG GGT AGC 
Cly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser> 

1740 1750 1760 1770 1780 



CAC 
GTC 
Gin 


GAC 
CTC 
Glu 


* 

CAC 
CTC 
Glu 


ATG 
TAC 
Met 


ACC 
TCC 
Thr 


* 

AAC AAC 
TTC TTC 
Lys Ann 


CAC 
CTC 
Gin 


* 

GTC 
CAC 

val 


AGC CTC 
TCC GAC 
Ser Leu 


• 

ACC TGC 
TGC ACC 
Thr Cys 


CTC 
GAC 
Leu 


GTC 
CAC 
Val 


* 

AAA 

TTT 
Lys> 




1790 
• 






1800 
• 




1810 


1820 








GCC 
CCG 
Cly 


TTC 
AAG 
Phe 


TAC 
ATC 

Tyr 


CCC 
GGG 

Pro 


ACC 
TCC 
Ser 


GAC ATC 
CTC TAG 
Asp He 


CCC 
CCG 
Ala 


CTC 
CAC 
Val 


CAG TGG 
CTC ACC 
Clu Trp 


GAC 
CTC 
Glu 


ACC 
TCC 
Ser 


AAT 
TTA 
Asn 


GGG 
CCC 
Cly 


CAC 
CTC 
Gln> 


1830 
• 




1840 
• 




1850 
* 






I860 




1870 






CCC 
GGC 
Pro 


GAG 
CTC 
Glu 


AAC 
TTC 
Asn 


AAC 
TTC 
Aon 


TAC 
ATC 
Tyr 


AAG ACC 

TTC TCC 
Lys Thr 


ACC 

TGC 
Thr 


CCT 
CGA 

Pro 


CCC CTG 
GGG CAC 
Pro Val 


CTC 
GAC 
Leu 


GAC 
CTC 
Asp 


TCC 
AGC 
Ser 


CAC 
CTC 
Asp 


GGC 
CCC 
Cly> 


1880 
• 






1890 
* 


1900 
<* 




1910 






1920 




TCC 
AGC 
Ser 


TTC 
AAC 
Phe 


TTC 
AAC 
Phe 


CTC 
CAC 
Leu 


TAC 
ATC 
Tyr 


AGC AGC 
TCC TCC 
Ser Arg 


CTA 
GAT 
Leu 


ACC 
TCG 
Thr 


CTG GAC 
CAC CTC 
Val Asp 


AAG 

TTC 
Lys 


AGC AGG 
TCC TCC 
Ser Arg 


TCC 
ACC 
Trp 


CAG 
CTC 
Gln> 



RECTIFIED SHEET (RULE 91) 
ISA/EP 



WO 96/40921 



PCT/US96/09287 



13/41 

FIG. 4 E 

1930 1940 1950 1960 1970 

• * • * # 

CAG CCC AAT CTC TTC TCA TCC TCC CTC ATG CAT GAG CCT CTC CAC AAC 
CTC CCC TTA CAG AAG ACT ACG AGG CAC TAC CTA CTC CGA CAC GTG TIC 
Glu Gly Afin Val Phe Ser Cyo Sex Val Mot Hie Glu Ala Leu His Aan> 

1980 1990 2000 2010 2020 

* - ♦ * * # 

CAC TAC ACA CAG AAG AGC CTC TCC CTG TCT CTC GGT AAA T GAG TCC CAG 
GTG ATG TCT CTC TTC TCC GAG AGC GAC AGA GAC CCA TTT A CTC ACG GTC 
His Tyr Tar Gin Lys Ser Leu Ser Leu Ser Leu Gly Lye Xjoc> 

2030 2040 2050 2060 2070 

♦ * - * * * 

CCC CCC CAA CCC CCC CCT CCC CCC CCT CTC GGG GTC CCC CGA GGA TCC 

CCC CCC CTT CCG GGG CGA GGG GCC CGA GAC CCC CAG CGC CCT CCT ACG 

2080 2090 2100 2110 

* * * * 

TTG GCA CCT ACC CCC TCT ACA TAC TTC CCA CGC ACC CAG CAT CGA AAT 
AAC CCT GCA TOG GGC AGA TCT ATG AAG GGT CCC TCG GTC GTA CCT TTA 

2120 2130 2140 2150 2160 

* * * * + 

AAA CCA CCC ACC ACT GCC CTG CGC CCC TCT GAG ACT GTG ATG CTT CTT 
TTT CCT GGC TCG TGA CCC GAC CCC GGG ACA CTC TGA CAC TAC CAA CAA 

2170 2180 2190 2200 2210 

* * * * * 

TCC ACC GGT CAG CCC GAG TCT GAG GCC TGA GTG ACA TGA GGG AGG CAG 
AGC TCC CCA CTC CCC CTC ACA CTC CCC ACT CAC TCT ACT CCC TCC GTC 

2220 2230 2240 2250 2260 

* * * * # 

AGC GGC TCC CAC TCT CCC CAC ACT GGC CCA CGC TCT GCA GGT GTG CCT 
TCC CCC AGC GTG ACA GGG CTC TGA CCC GGT CCC ACA CCT CCA CAC GGA 

2270 2280 2290 2300 2310 

* * • • * 

CCC CCA CCT AGG GTG GGG CTC AGC CAG GGG CTC CCC TCG CCA GGG TCG 
CCC CCT GGA TCC CAC CCC GAG TCG GTC CCC GAC CGC ACC CCT CCC ACC 

2320 2330 2340 2350 

• * ♦ • 

GCC ATT TCC CAG CCT GGC CCT CCC TCC AGC AGC AGG ACT CTA GAG GAT 

CCC TAA ACC CTC CCA CCC GGA GGC AGG TCC TCC TCC TGA CAT CTC CTA 

2360 2370 2380 2390 2400 

• * * * * 

CAT AAT CAG CCA TAC CAC ATT TCT AGA GGT TIT ACT TCC TTT AAA AAA 
CTA TTA CTC GGT ATG CTC TAA ACA TCT CCA AAA TGA ACC AAA TTT TTT 

2410 2420 2430 2440 2450 

* * * * • 

CCT CCC ACA CCT CCC CCT CAA CCT GAA ACA TAA AAT GAA TCC AAT TCT 
CGA GCC TCT GGA GGC CCA CTT GGA CTT TCT ATT TTA CTT ACC TTA ACA 
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FIG. 4 F 

2460 2470 2480 2490 



2500 



TGT TCT TAA CTT CTT TXT TCC ACC TTA TAA TGG TTA CAA ATA AAC CAA 
ACA ACA ATT CAA CAA ATA ACG TCC AAT ATT ACC AAT CTT TAT TTC CTT 

2510 2520 2530 2S40 2550 

* 

TAG CAT CAC AAA TIT CAC AAA TAA ACC ATT TTT TTC ACT CCA TTC TAG 
ATC CTA CTC TTT AAA CTC TTT ATT TCC TAA AAA AAC TCA CCT AAC ATC 

2560 2570 2580 2590 

* * * * 

TTG TCC TTT CTC CAA ACT CAT CAA TCT ATC TTA TCA TOT CTC GAT CCT 
AAC ACC AAA CAG CTT TGA GTA CTT ACA TAG AAT ACT ACA CAC CIA CCA 

2600 2610 2620 2630 2640 

* * * * * 

CTA CCC CCG ACC CAT CCT CGC CGG CAT CAC CCC CCC CAC ACC TCC CCT 
GAT CCC CCC TCC GTA GCA CCC CCC GTA GTG CCC CCG CTC TCC ACC CCA 

2650 2660 2670 2680 2690 

* * ♦ * # 

TCC TGG CCC CTA TAT CGC CCA CAT CAC CCA TGG CCA AGA TCC CGC TCG 
ACG ACC, CCC GAT ATA CCG GCT GTA CTC CCT ACC CCT TCT ACC CCC ACC 

2700 2710 2720 2730 2740 

* * * ■ ♦ ' * 

CCA CTT CCC CCT CAT GAG CCC TTC TTT CCG CCT CGG TAT CCT CGC ACG 
GGT CAA CCC CCA GTA CTC CCC AAC AAA CCC CCA CCC ATA CCA CCG TCC 

2750 2760 2770 2780 2790 

♦ ■ * * * « 

CCC CTC CCC CGG GCA CTC TTG CCC CCC ATC TCC TTG CAT GCA CCA TTC 
GGG CAC CCG CCC CCT GAC AAC CCG CCG TAG ACG AAC GTA CCT GGT AAC 

2800 2810 2820 2830 

* • *■ * 

CTT CCC CCC CCG CTC CTC AAC CCC CTC AAC CTA CTA CTC CCC TCC TTC 
CAA CCC CGC CCC CAC GAG TTC CCG GAG TTC GAT GAT GAC CCC ACC AAC 

2840 2850 2860 2870 2880 

* * ♦ • * 

CTA ATC CAG GAG TCC CAT AAC CCA GAG CCT CCA CCT CCC CCC GCG TTG 
GAT TAC CTC CTC ACC GTA TTC CCT CTC CCA CCT CCA CCC CCC CCC AAC 

2890 2900 2910 2920 2930 

• * • . ♦ * 

CTC CCC TTT TTC CAT ACC CTC CCC CCC CCT GAC GAG CAT CAC AAA AAT 
GAC CCC AAA AAC GTA TCC GAC GCG GCG CCA CTC CTC CTA CTC TIT TTA 

2940 2950 2960 2970 2980 

♦ • • • - * 

CCA CCC TCA ACT CAC AGG TGG CCA AAC CCC ACA GCA CTA TAA ACA TAC 
CCT CCC ACT TCA CTC TCC ACC CCT TTG CCC TCT CCT GAT ATT TCT ATG 
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FIG. 4 G 

2990 3000 3010 3020 3030 
* 

CAC GCC TIT CCC CCT CCA ACC TCC CTC CTC CGC TCT CCT CTT CCC ACC 

CTC CCC AAA CCC CCA CCT TCC ACC GAG CAC CCC AGA CCA CAA CCC TCC 

3040 3050 3060 3070 

* * * * 

CTG CCC CTT ACC CCA TAC CTC TCC CCC TTT CTC CCT TCC CCA ACC CTC 

CAC CCC CAA TCC CCT ATC CAC ACC CCC AAA GAG CCA ACC CCT TCC CAC 

3080 3090 3100 3110 3120 

* 

GCC CTT TCT CAA TCC TCA CCC TCT ACC TAT CTC ACT TCC CTC TAC CTC 
CCC CAA AGA CTT ACC AGT CCC ACA TCC ATA GAG TCA ACC CAC ATC CAC 

313 ° 3 "° 3150 3160 3170 

* * • 

CTT CCC TCC AAG CTG CGC TCT GTC CAC GAA CCC CCC CTT CAC CCC rxn 
CAACCCACCTTCCACCCGACACACCTCCTTGCCGCCcS 

21B l 3190 3200 3210 3220 

CCC TGC CCC TTA TCC CGT AAC TAT CCT CCT GAG TCC AAC CCC cm Irsx 
CCG ACC CCG AAT ACC CCA TIG ATA CCA GAA CTC ATC TTG CGC CAT TCT 



3230 3240 3250 



3260 3270 



CAC CAC TTA TCC CCA CTG GCA CCA GCC ACT CGT AAC ACC XTT xnn 
CTG CTG AAT ACC GGT GAC CCT CCT CCC TCA CCA TTG TCC TAA TCC TCT 

3280 3290 3300 3310 

GCC ACC TAT CTA CGC CCT GCT ACA GAG TTC TTG AAG TCC TCC CCT AAC 
CCC TCC ATA CAT CCC CCA CCA TCT CTC AAG AAC TTC ACC ACC CGA TTC 

3320 3330 3340 3350 3360 

* * • # * 

TAC CCC TAC ACT ACA ACC ACA CTA TTT CCT ATC TCC CCT CTC CTG AAC 
ATC CCC ATC TCA TCT TCC TCT CAT AAA CCA TAG ACC CCA CAC GAC TTC 

3370 3380 3390 3400 3410 

* 

CCA CTT ACC TTC CGA AAA AGA CTT GGT ACC TCT TCA TCC CCC AAA CAA 
CCT CAA TCG AAG CCT TTT TCT CAA CCA TCC AGA ACT ACC CCC TTT CTT 

3«0 3430 3440 3450 3460 

ACC ACC CCT CCT ACC CGT CCT TTT TTT CTT TGC AAG CAG CAG ATT ACC 
TGG TCG CGA CCA TCG CCA CCA AAA AAA CAA ACC TTC CTC CTC TAA TGC 

3470 34Q0 3490 3500 3510 

♦ ♦ 

CCC ACA AAA AAA GGA TCT CAA GAA GAT CCT TTG ATC TTT TCT ACC CGC 
GCC TCT TTT TTT CCT AGA CTT CTT CTA GGA AAC TAC AAA AGA TCC CCC 
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FIG. 4 H 



3520 3530 3540 3550 

!5F 00 ^ ** C °* ™» TAA CCC ATT TIG CTC ATC 

ACA CTC CCA GTC ACC TTC CTT TTC ACT CCA ATT CCC TAX AAC CAC TAC 

3S60 3570 3580 3590 3600 

AOL TTA TCA AAA AGO ATC TTC ACC TAG ATC CTT TTA AAT TaI AAA TCA 
TCT AAT ACT TTT TCC TAG AAG TCC ATC TAG CAA AAT TTA" ATT TTT IS 

3«0 3620 3630 3640 3650 

• • • 

JST TIT Z£ V* ATC TA * ^ ATA TXT ^ ™ *« TCT CAC ACT 

TCA AAA TTT ACT TAG ATT TCA TAT ATA CTC ATT TCA ACC ACA CTC TCA 



3660 3670 3680 3690 



3700 



I^^^™ ATC ^^^ OTATCT »^ATCTCTCTATTr 
ATC CTT ACC AAT TAG TCA CTC CGT CCA TAG ACT CCC TAG ACA GAT AAA 



3710 3720 3730 3740 



3750 



CGT TCA TCC ATA CTT CCC TCA CTC CCC GTC GTC TAG ATA ACT ACC xJi 
CCA ACT AGG TAT CAA CGG ACT GAG CCC CAC CAC ATC TAT TCA TCC TAT 



3760 3770 3780 



3790 



^^^^^^^^^^^ATGATACCCCCACAC 
CTC CCC AAT CGT ACA CCC CGG TCA CCA CGT TAC TAT GGC OCT CTC 

3800 3810 3820 3830 3340 

* * * 

aacxKTaccGOTcatMmTaoamjuurcMmcccBa 

GOT CCC AGT GGC CCA GOT CTA AAT ACT COT TAT TTC GTC GOT CCC CCT 

3850 3B60 3870 3880 3990 

AGG GCC CAC CCC ACA AGT GGT CCT CCA ACT TTA TCC CCC TCC ATC CXC 



3900 3910 3920 3930 



* 



3940 



TCT ATT AAT TCT TCC CCC CAA CCT ACA CTA AGT AGT TCB CCX rw >\. 

acataattaacaacccccotcgatctcmtS 

39 " 3960 3970 3980 3990 

* • . 

^I^^iJf^ OT ^ AWOTAaWATC ™^TCACCC 
TCA AAC CCC TTC CAA CAA CCC TAA CCA TCT CCC TAG CAC CAC ACT CCC 



4000 4010 4020 



4030 



TCC TCC TTT CGT ATC CCT TCA TTC ACC TCC CGT TCC CAA CCA TCA ACC 
AGC ACC AAA CCA TAC CCA ACT AAC TCC AGG CCA AGG CTT CCT ACT TCC 
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FIG. 4 I 



4040 4050 4060 4070 4080 

• * * • « 

CCA CTT ACA TCA TCC CCC ATC TTC TCC AAA AAA CCC CTT ACC TCC TTC 
OCT CAA TCT ACT ACS CCC TAC AAC ACC TTT TIT CCC CAA TCC ACC AAG 

4090 4100 4X10 4120 4130 

• * * * ' . • 

CCT CCT CCC ATC CTT CTC ACA ACT AAG TTC CCC CCA GTC TTA TCA CTC 
CCA CCA CCC TAC CAA CAC TCT TCA TTC AAC CCC CCT CAC AAT ACT GAG 

4140 4150 4160 4170 4180 

# • * * # 

ATC CTT ATC CCA CCA CTC CAT AAT TCT CTT ACT CTC ATC CCA TCC GTA 
TAC CAA TAC CCT CCT CAC CTA TTA ACA CAA TCA CAO TAC CCT ACC CAT 

4190 4200 4210 4220 4230 

• • * * * 

ACA TCC TTT TCT CTC ACT CCT CAC TAC TCA ACC AAC TCA TTC TCA CAA 
TCT ACC AAA ACA CAC TCA CCA CTC ATC ACT TCC TTC ACT AAC ACT CTT 

4240 4250 4260 4270 

* * * * 

TAG TCT ATC CCC CCA CCC ACT TCC TCT TCC CCC COG TCA ACA CCC CAT 
ATC ACA TAC CCC CCT CCC TCA ACQ ACA ACQ CCC CCC ACT TCT CCC CIA 

4280 4290 43G0 4310 4320 

• .* * * * ' 

AAT ACC GCG CCA CAT ACC ACA ACT TTA AAA GTC CTC ATC ATT GGA AAA 
TTA TCG CCC GCT GTA TCC TCT TCA AAT TTT CAC GAG TAG TAA CCT TTT 

4330 4340 4350 4360 4370 

* * * * * 

CCT TCT TCG GCG CGA AAA CTC TCA ACG ATC TTA CCG CTC TTG ACA TCC 
CCA ACA AGC CCC GCT TTT GAG ACT TCC TAG AAT CCC GAC AAC TCP AGG 

4380 4390 4400 4410 4420 

* * # * * 

ACT TCG ATC TAA CCC ACT CCT CCA CCC AAC TCA TCT TCA CCA TCT TIT 
TCA AGC TAC ATT GCG TCA CCA CCT GCG TTG ACT ACA ACT CCT ACA AAA 

4430 4440 4450 4460 4470 

* * * # * 

ACT TTC ACC ACC CTT TCT CGC TCA CCA AAA ACA CGA ACG CAA AAT CCC 
TCA AAC TCG TCC CAA ACA CCC ACT CCT TTT TCT CCT TCC GTT TIA CCG 

4480 4490 4500 4510 

* # - * • * 

CCA AAA AAG CCA ATA ACC CCC ACA CCG AAA TCT TCA ATA CTC ATA CTC 
OCT TTT TTC CCT TAT TCC CCC TCT CCC TTT ACA ACT TAT GAG TAT GAG 

4520 4530 4540 4550 4560 

• • • * • 

TTC CTT TTT CAA TAT TAT TGA ACC ATT TAT CAC CCT TAT TCT CTC ATC 
AAG CAA AAA CTT ATA ATA ACT TCC TAA ATA GTC CCA ATA ACA GAG TAC 
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FIG. 4 J 

4570 4580 4590 4600 4610 

* * 

AGC CGA TAC ATA TTT CAA TCT ATT TAG AAA AAT AAA CAA ATA CCC GTT 
TCC CCT ATC TAT AAA CTT ACA TAA ATC TTT TTA TTT CTT TAT CCC CAA 



4620 4630 4640 4650 



4660 



CCC CCC ACA TTT CCC CCA AAA GTG CCA CCT CAC CTC TAA CAA ACC ATT 

CCC CCC TCT AAA CCC CCT TTT CAC CCT CCA CTC CAC ATT CTT TCC TAA 

4670 4680 4690 4700 4710 

ATT ATC ATC ACA TTA ACC TAT AAA AAT ACC CCT ATC ACC ACC CCC TCA 

TAA TAG TAC TCT AAT TCC ATA TTT TTA TCC CCA TAG TCC TCC CCC ACT 

4720 4730 4740 4750 

* * • 

TCC CTC TTT CCC CCA CCC ATC CTT CCT AAT CTT CCC TCC CAC CCA CCA 

ACC GAG AAA CCC CCT CCC TAG CAA CCA TTA CAA CCC ACC CTC CCT CCT 



4760 4770 4780 4790 



4800 



— m 

CAA CCC TCA AGA CAA AAT CTA ATC ACA CTC CCT CAC CTT CCC CTC CGC 

CTT CCC ACT TCT CTT TTA CAT TAG TCT GAC CCA CTC CAA CCC CAC CCC 

4B10 4820 4830 4840 4850 

♦ * 

CTT TCT CCC TTT ATA AGG ACA CAC TTT ATC TTT AAC AAG CTT CCT AAA 
CAA AGA CCC AAA TAT TCC TCT CTC AAA TAC AAA TTC TTC CAA CCA TTT 



«60 4870 4880 4890 



4900 



TTC CTT CCC OCT TTC GCA CCC AAC CTA GAG ATC TCT AGC TTC Gin TT\ 

4910 «*0 4930 4940 4950 

* * * " 

TCC TCC CAC TCA CCT CAC TEA TTA TTT TAC ACA CAA ACA CCC TTT ATC 

4960 4970 4980 4990 

^ir?^^ TTO ^^^^^^^CCCOTATACTC 
CGC AAA ACT CTA AAG ACA GCG GCT GAT TTA ACT ACA GCG CGC TAT CAC 

5000 5010 5020 5030 5040 

CTC TTT ATC GCC GAT AGA GAT GCC GAT ATT CCA AAA ATC CAT ~* 
CAC AAA TAG CCC CTA TCT CTA CCC CTA TAA CCT TTT TAG CTA TAA ACT 

5050 5060 5070 5080 5090 

* * • • * 

AAA TAT GCC ATA TTC AAA ATC TCC CCG ATC TCA GTT TCT CTC TAA CTC 
TTT ATA CCG TAT AAC TTT TAC AGC CGC TAC ACT CAA ACA CAC ATT GAC 
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FIG. 4 K 

5100 5110 5120 5130 5140 

• • * * # 

ATA TCC' CCA TTT TTC CAA AAG TGA TTT TTC CGC ATA CCC CAT ATC TGG 
TAT ACC CGT AAA AAC CTT TTC ACT AAA AAC CCC TAT CCC CTA TAG ACC 

5150 5160 5170 5180 5190 

♦ * • • * # 

CCA TAG CCC TTA TAT CCT TTA CCC CCC ATC CCC ATA GAC GAC TTT CGT 
CCT ATC CCC AAT ATA CCA AAT CCC CCC TAC CCC TAT CTC CTC AAA CCA 

5200 5210 5220 5230 

• * * • 

GAC TIG CGC GAT TCT CTC TGT CCC AAA TAT CGC ACT TTC GAT ATA GGT 
CTC AAC CCC CTA AGA CAC ACA CCC TTT ATA CCC TCA AAG CTA TAT CCA 

5240 5250 5260 5270 5280 

* • ■ • • • 

GAC AGA CCA TAT GAG CCT ATA TCC CCG ATA GAC CCC ACA TCA ACC TGG 
CTC TCT CCT ATA CTC CCA TAT ACC CGC TAT CTC CCC TCT ACT TCC ACC 

5290 5300 5310 5320 5330 

• * • * * 

CAC ATC GCC AAT GCA TAT CCA TCT ATA CAT TCA ATC AAT ATT CGC CAT 
CTC TAC CCC TTA CCT ATA CCT AGA TAT CTA ACT TAG TTA TAA CCC CTA 

5340 ^ 535C 5360 5370 5380 

• • • • • 

TAG CCA TAT TAT TCA TTC CTT ATA TAG CAT AAA TCA ATA TTC CCT ATT 
ATC CGT ATA ATA ACT AAC CAA TAT ATC CTA TTT ACT TAT AAC CCA TAA 

5390 5400 5410 5420 5430 

• * ■ • • * 
CGC CAT TCC ATA CCT TGT ATC CAT ATC ATA ATA TCT ACA TTT ATA TTC 
CCC CTA ACC TAT GCA ACA TAG GTA TAG TAT TAT ACA TGT AAA TAT AAC 

5440 5450 5460 5470 

* * * • 

GCT CAT CTC CAA CAT TAC CGC CAT CTT GAC ATT GAT TAT TGA CTA CTT 
CGA GTA CAG GTT GTA ATG GCG CTA CAA CTC TAA CTA ATA ACT GAT CAA 



5480 5490 5500 5510 



5520 



ATT AAT ACT AAT CAA TTA CGC GGT CAT TAG TTC ATA GCC CAT ATA TGG 
TAA TEA TCA TTA GTT AAT GCC CCA CTA ATC AAG TAT CGC GTA TAT ACC 

5530 5540 5550 5560 5570 

* * * * 

ACT TCC GCG TTA CAT AAC TTA CGC TAA ATG GCC CCC CTC GCT GAC CGC 
TCA AGG CGC AAT CTA TTG AAT GCC ATT TAC CCG GCC GAC CCA CTG GCC 



5580 5590 5600 5610 



5620 



CCA ACC ACC CCC GCC CAT TGA CGT CAA TAA TGA CCT ATG TTC CCA TAG 
CGT TGC TGG GGG CGG CTA ACT CCA CTT ATT ACT CCA TAC AAG GCT ATC 
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FIG. 4 L 



5630 5640 5650 5660 5670 

. • * , * * * 

TAX CCC CAA TAG CGA CTT TCC ATT GAC GTC AAT COG TCG ACT ATT TAG 
ATT GCG GTT ATC CCT GAA AGG TAA CTC GAG TTA CCC ACC TCA TAA ATC 

5680 5690 5700 5710 

* * ♦ • 

GGT AAA CTC CCC ACT TCG CAG TAG ATC AAG TGT ATC ATA TGC CAA GTA 
CCA TTT GAC GCG TCA ACC GTC ATG TAG TTC ACA TAG TAT ACC GTT CAT 

5720 5730 5740 5750 5760 

* * • ♦ * 

CGC CCC CTA TTC ACC TCA ATG ACC GTA AAT GCC CCC CCT CGC ATT ATC 
GCG GCG CAT AAC TGC ACT TAC TCC CAT TTA CCG GCC CGA CCC TAA TAC 

5770 57B0 5790 5B00 5B10 

• ' • * * . * 

CCC ACT ACA TCA CCT TAT GGG ACT TTC CTA CTT GGC ACT ACA TCT ACC 
GGG TCA TCT ACT CGA ATA CCC TCA AAG GAT GAA CCG TCA TCT AGA TGC 

5820 5830 5840 5850 5860 

• * * * • 

TAT TAG TCA TCC CTA TTA CCA TCC TCA TGC GGT TTT GGC ACT ACA TCA 
ATA ATC ACT ACC CAT AAT GGT ACC ACT ACC CCA AAA CCC TCA TGT ACT 

5870 5880 5890 5900 5910 

* * * * • 

ATC GCC GTC GAT ACC GGT TTC ACT CAC GCC GAT TTC CAA GTC TCC ACC 
TAC CCC CAC CTA TCC CCA AAC TGA GTG CCC CTA AAG GTT CAG AGG TCG 

5920 5930 5940 5950 

* * * * 

CCA TTC ACG TCA ATG GGA GTT TGT TTT CCC ACC AAA ATC AAC GGG ACT 
GGT AAC TGC ACT TAC CCT CAA ACA AAA CCC TCG TTT TAG TTC CCC TCA 

5960 5970 5980 5990 6000 

* • * * * 

TTC CAA AAT GTC CTA ACA ACT CCC CCC CAT TCA CCC AAA TCC GCC CTA 
AAG GTT TTA CAC CAT TCT TGA GGC GGG GTA ACT GCG TTT ACC CGC CAT 

6010 6020 6030 6040 6050 

* • * * * 

GGC GTG TAC GGT GGG AGG TCT ATA TAA GCA GAG CTC GTT TAG TGA ACC 
CCG CAC ATG CCA CCC TCC AGA TAT ATT CCT CTC GAG CAA ATC ACT TCG 

6060 6070 6080 6090 6100 

* ♦ * * * 

GTC AGA TCG CCT CGA GAC GCC ATC CAC GCT GTT TTC ACC TCC ATA GAA 
CAG TCT AGC GGA CCT CTG CGG TAG GTG CGA CAA AAC TCG AGG TAT CTT 

6110 6120 6130 6140 6150 

* * * * < * 

GAC ACC GGG ACC GAT CCA GCC TCC GCG GCC GGC AAC GGT GCA TTC GAA 
CTG TCG CCC TGG CTA GGT CGG ACQ CGC CGC CCC TTC CCA CCT AAC CTT 
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FIG. 4 M 

6160 6170 6180 6190 

* • ♦ * 

CCC GGA TTC CCC CTC CCA AGA GTG ACQ TAX GTA CCC CCT ATA GAG TCT 
GCG CCT AAG GGG CAC CCT TCT CAC TCC ATT CAT CCC CCA TAT CTC AGA 

6200 6210 6220 6230 6240 

* * * * # 

ATA CCC CCA CCC CCT TCC CTT CTT ATC CAT CCT ATA CTC TTT TTC CCT 
TAT CCC CCT CCC GGA ACC GAA GAA TAC GTA CGA TAT CAC AAA AAC CCA 

62S0 6260 6270 6280 6290 

* • * * * 

TCC CCT CTA TAC ACC CCC CCT TCC TCA TCT TAT ACC TGA TCC TAT ACC 
ACC CCA GAT ATC TCC CCC CGA AGC ACT ACA ATA TCC ACT ACC ATA TCC 

6300 6310 6320 6330 6340 

• * * * • 

TTA CCC TAT ACC TCT CCC TEA TTC ACC ATT ATT CAC CAC TCC CCT ATT 
AAT COG ATA TCC ACA CCC AAT AAC TCC TAX TAA CTC CTC AGC CGA TAA 

6350 6360 6370 63B0 6390 

• # * * * 

CCT CAC GAT ACT TTC CAT TAC TAA TCC ATA ACA TCC CTC TTT CCC ACA 
CCA CTC CTA TCA AAC GTA ATC ATT ACC TAT TCT ACC GAG AAA CCC TCT 

6400 6410 6420 6430 

* * * * 

ACT CTC TTT ATT CCC TAT ATC CCA ATA CAC TCT CCT TCA GAG ACT CAC 
TCA GAG AAA TAA CCC ATA TAC GOT TAT CTC ACA CCA ACT CTC TGA CTC 

6440 6450 6460 6470 6480 

• • • • • 

ACC GAC TCT CTA TTT TTA CAG CAT CCC CTC TCA TTT ATT ATT TAC AAA 
TCC CTC AGA CAT AAA AAT CTC CTA CCC CAG ACT AAA TAA TAA ATC TTT 

6490 6500 6510 6520 6530 

* # • * * 

TTC ACA TAT ACA ACA CCA CCC TCC CCA GTG CCC CCA CTT TTT ATT AAA 
AAC TCT ATA TCT TCT CCT CGC AGG CCT CAC GGG CCT CAA AAA TAA TTT 

6540 6550 6560 6570 6580 

* * * * • 

CAT AAC CTC GGA TCT CCA CGC GAA TCT CCC CTA OCT CTT CCC GAC ATC 
GTA TTC CAC CCT AGA CCT GCG CTT AGA CCC CAT CCA CAA CCC CTC TAC 

6590 6600 6610 6620 6630 

* * * * • 

GGC TCT TCT CCC CTA CCC GCG GAC CTT CTA CAT CCG AGC CCT CCT CCC 
CCG AGA AGA GGC CAT CCC CGC CTC GAA GAT GTA GGC TCG GGA CGA GGG 

6640 6650 6660 6670 

• # • * 

ATC CCT CCA GCG ACT CAT GGT CGC TCG CCA CCT CCT TCC TCC TAA CAG 
TAC GGA GGT CGC TCA GTA CCA GCG AGC CCT GGA GGA ACC AGG ATT CTC 
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FIG. 4 N 



6680 6690 6700 6710 6720 

♦ • ' . ♦ * * 

TGG ACG CCA GAC TTA CCC ACA GCA CCX TCC CCX CCX CCX CCA CTC TCC 
ACC TCC GOT CTG AAT CCC TGT CGT GCT ACG CCT GOT GGT GOT CAC ACG 

673 0 6740 6750 6760 6770 

• * * * * 

CGC ACA AGG CCC TGG CGG TAG GGT ATO TGT CTC AAA ATG ACC TCG GGC 
GCG TGT TCC CGC ACC CCC ATC CCA TAC ACA GAC TTT TAC TCG ACC CCC 

6780 6790 6900 6B10 6820 

* ♦ * * * 

ACC CGG CTT CCA CCC CTG ACG CAT TTG GAA GAC TTA ACG CAC CCC CAG 
TCG CCC GAA CCT GGC GAC TGC CTA AAC CTT CTG AAT TCC CTC GCC CTC 

6830 6840 6850 6860 6870 

* * • * * 

AAC AAC ATC CAG CCA CCT CAC TTC TTG TGT TCT GAT AAC ACT CAG AGG 
TTC TTC TAC CTC CCT CCA CTC AAC AAC ACA AGA CTA TTC TCA CTC TCC 

6880 6890 6900 6910 

* * * * 

TAA CTC CCC TTC CCC TCC TGT TAA CCC TGG AGG CCA CTC TAC TCT GAG 
ATT CAC CCC AAC CCC ACC ACA ATT CCC ACC TCC CCT CAC ATC ACA CTC 

6920 6930 6940 6950 6960 

• * * • - * 

CAC TAC TCC TTG CTC CCC CCC CCC CCA CCA CAC ATA ATA CCT CAC AGA 
CTC ATC ACC AAC CAC CCC CCC CCC CCT CCT CTC TAT TAT CCA CTC TCT 

6970 69B0 6990 7000 7010 

• * * * ♦ 

CTA ACA CAC TCT TCC TTT CCA TCC CTC TTT TCT CCA CTC ACC CTC CTT 
CAT TCT CTC ACA ACC AAA CCT ACC CAC AAA ACA CCT CAC TCC CAC GAA 

7020 7030 7040 7050 7060 

* • * * • 

GAC ACC AAC CTT CGC CTC CAC CTC CAT CCA CTC TAG AGG ATC GAT CCC 
CTG TGC TTC GAA CCC GAC CTC CAC CTA GCT GAG ATC TCC TAC CTA GGG 

7070 

CGC GCG ACC TC 
CCC CCC TCC AC 
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_ The pEe12TF8LCDR3 expression vector DNA sequence. The 

FIG 5 A coding regions of the TF8-5G9 CDR-grafted LC gene, TF8LCDR3. 
%s n are translated. 

Sequence Range: 1 to 7864 

10 20 30 40 50 

* * • ♦ * 

AAT TCA CC ATG GCT CTC CCA ACT CAC CTA TEA GGA TTA CTG CTC CTG TCC 
TTA ACT GC TAC CCA CAC GCT TGA GTC CAT AAT CCT AAT GAC CAC GAC ACC 
Met Gly Val Pro Thr Gin Val Leu Gly Leu Leu Leu Leu Trp> 

60 70 80 90 



CTT 
CAA 
Leu 


ACA 
TGT 
Thr 


CAT 
CTA 
Asp 


* 

CCA 
CCT 
Ala 


ACA 
TCT 
Arg 


TCT 
ACA 

Cya 


* 

GAT 
CTA 
Asp 


ATC 
TAG 
lie 


CAA 
GTT 
Gin 


» 

ATC 
TAC 
Met 


ACA 
TCT 
Thr 


CAA 
GTT 
Gin 


TCT 
ACA 
Ser 


* 

CCT TCT 
GCA ACA 
Pro Ser 


TCT 
ACA 
Ser> 


LOO 
* 






110 
• 






120 




130 






140 






CTA 
GAT 
Leu 


ACT 
TCA 
Ser 


CCT 
CCA 
Ala 


TCT GTC 
ACA CAC 
Ser Val 


CCA 
CCT 
Gly 


GAT 
CTA 
Asp 


ACA 
TCT 
Arg 


CTA 
CAT 
Val 


ACA 
TCT 
Thr 


ATT 
TAA 
lie 


ACA 

TGT 
Thr 


TGT 
ACA 
Cyo 


AAC CCC 
TTC CCC 
Lye Ala 


ACT 
TCA 
Ser> 


150 
* 




160 






170 






180 




190 




CAG 
CTC 
Gin 


GAC 
CTG 
Asp 


ATT 
TAA 
lie 


ACA 
TCT 
Arg 


AAC 
TTC 
Lys 


TAT 
ATA 
Tyr 


TXA 
AAT 
Leu 
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* 






330 






ATT 
TAA 
lie 


TCT 
ACA 
Ser 


TCT 
ACA 
Ser 


CTC 
GAC 
Leu 


CAA 

CTT 
Gin 


CCT 
CCA 

Pro 


GAC 
CTC 
Glu 


GAC 
CTC 
Asp 


ATT 
TAA 
lie 


GCT 
CCA 
Ala 


ACA 
TCT 
Thr 


TAC 
ATC 
Tyr 


TAC 
ATG 
Tyr 


TCC 
ACC 
Cys 


CTA 
GAT 
Leu 


CAA 

GCT 
Gln> 



340 350 360 370 380 







• 








* 






* 






• 




CAT GCT 


CAC 


ACT 


CCC 


TAT 


ACA. 


TTT 


CCA 


CAA 


GGA 


ACA 


AAA 


CTA GAC 


ATC 


CTA CCA 


CTC 


TCA 


ccc 


ATA 


TCT 


AAA 


CCT 


CTT 


CCT 


TCT 


TTT 


CAT CTC 


TAC 


His Cly 


Glu 


Ser 


Pro 


Tyr 


Thr 


Phe 


Gly 


Gin Gly 


Thr 


Lys 


Leu Glu 


Ile> 


390 




400 






410 






420 




430 










• 






* 












* 




ACA ACA 


ACT 


CTT 


ccc 


ccc 


CCC 


TCT 


CTC 


TTC 


ATC 


TOC 


CCC 


CCA TCT 


GAT 


TGT TCT 


TGA 


CAA 


ccc 


ccc 


CCC 


ACA 


CAS 


AAC 


TAG 


AAG 


CCC 


CCT ACA 


CTA 


Thr Arg 


Thr 


Val 


Ala 


Ala 


Pro 


Ser 


val 




lie 


Phe 


Pro 


Pro Ser 


Asp> 



RECTIFIED SHEET (RULE 91) 
ISA/EP 



WO 96/40921 



PCT/US96/09287 



24/41 

FIG. 5 B 



440 450 460 470 490 

* * • * ♦ 

GAG CAG TTG AAA TCT GGA ACT CCC TCT CTT CTG TGC CTG CTC AAT AAC 
CTC GTC AAC TTT AGA CCT TGA CCG AGA CAA CAC ACC GAC GAC TTA TTG 
Glu Gin Leu Lyo Ser Gly Thr Ala Ser Val Val Cye Leu Leu Asn Asn> 

490 500 510 520 530 

♦ * * ♦ * 
TTC TAT CCC AGA GAG GCC AAA GTA CAC TGG AAC GTG GAT AAC CCC CTC 
AAG ATA CGG TCT CTC CCG TTT CAT GTC ACC TTC CAC CTA TTG CGC GAC 
Phe Tyr Pro Arg; Glu Ala Lyo Val Gin Trp Lye Val Asp Asn Ala Leu> 

540 550 560 570 

♦ * ♦ * 

CAA TCC GGT AAC TCC CAG GAG ACT GTC ACA GAG CAG GAC ACC AAG GAC 
GTT AGC CCA TTG AGG GTC CTC TCA CAG TGT. CTC GTC CTG TCC TTC CTG 
Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lya Asp> 

580 590 600 610 620 

♦ * * * * 

AGC ACC TAG AGC CTC AGC AGC ACC CTG ACC CTG ACC AAA GCA GAC TAC 
TCG TGC ATC TCC GAG TCC TCC TGG GAC TCC GAC TCC TTT CCT CTG ATC 
Ser Thr Tyr Ser Leu Ser Scr Thr Leu Thr Leu Ser Lya Ala Asp Tyr> 

63 0 640 650 660 670 

* * * • * 

GAG AAA CAC AAA CTC TAC CCC TGC GAA GTC ACC CAT CAG CCC CTC ACC 
CTC TTT GTG TTT CAC ATC CCG ACC CTT CAG TGG CTA CTC CCC GAC TCC 
Glu Lya His Lya Val Tyr Ala Cye Glu Val Thr His Gin Gly Leu Ser> 

680 690 700 710 720 

• * * * * 

TCG CCC CTC ACA AAG AGC TTC AAC AGG GGA GAG TCT T AGA CGG AGA ACT 
AGC CGG CAG TGT TTC TCG AAG TTC TCC CCT CTC ACA A TCT CCC TCT TCA 
Ser Pro Val Thr Lya Ser Phe Asn Arg Gly Glu Cya> 

730 740 750 760 770 

• • * • * 

CCC CCC ACC TCC TCC TCA GTT CCA GCC TCC GGA TCA TAA TCA CCC ATA 
CCG GGG TCG ACC AGG ACT CAA CCT CCC ACC CCT ACT ATT ACT CGC TAT 

780 790 BOO 810 

• * * * 

CCA CAT TTC TAG AGG TTT TAC TTG CTT TAA AAA ACC TCC CAC ACC TCC 
GGT GTA AAC ATC TCC AAA ATC AAC GAA ATT TTT TGC ACC GTC TCG ACC 

820 830 840 850 860 

* • • * • 

CCC TGA ACC TCA AAC ATA AAA TCA ATC CAA TTC TTC TTC TTA ACT TCT 
CCG ACT TGG ACT TTC TAT TTT ACT TAC GTT AAC AAC AAC AAT TGA ACA 

870 880 890 900 910 

* > • * * * 

TTA TTC CAG CTT ATA ATC GTT ACA AAT AAA GCA ATA GCA TCA CAA ATT 
AAT AAC GTC CAA TAT TAC CAA TCT TTA TTT CCT TAT CCT ACT CTT TAA 



RECTIFIED SHEET (RULE 91) 

ISA/EP 



WO 96/40921 



PCT/US96/09287 



25/41 

FIG. 5 C 



920 930 940 950 960 

* * * * t * 
TCA CAA ATX AAC CAT TTT TIT CAC TCC ATT CTA CTT CTG GIT TCT CCA 
ACT CTT TAT TTC CTA AAA AAA CTC ACQ TAA CAT CAA CAC CAA ACA CCT 

970 9B0 990 1000 1010 

* * * • 

AAC TCA TCA ATG TAT CTT ATC ATG TCT CCA TCC TCT ACC CCG CAC CCA 
TTG ACT AGT TAC ATA GAA TAG TAC ACA CCT ACC ACA TGC CCC CTG OCT 

1020 1030 1040 1050 

* * * • 

TCC TCG CCC CCA TCA CCC CCC CCA CAC CTC CCC TTG CTG CCC CCT ATA 
AGC ACC CCC CCT ACT GGC CCC GCT CTC CAC CCC AAC CAC CCC CGA TAT 

< 

1060 1070 10B0 1090 1100 

* '* * * ♦ 

TCC CCG ACA TCA CCC ATG GGC AAC ATC CGC CTC CCC ACT TCG CCC TCA 
ACC GGC TGT AGT CGC TAC CCC TTC TAG CCC GAG CCG TCA ACC CCG ACT 

1110 1120 1130 1140 1150 

* * * * ♦ 

TGA CCG CTT CTT TCC CCG TGC CTA TGC TGC CAC CCC CCT CGC CGC GGC 
ACT CCC GAA CAA ACC CCC ACC CAT ACC ACC CTC CCC CCA CCC CCC CCC 

1160 1170 HBO 1190 1200 

* * • * • * 

ACT CTT CCG CCC CAT CTC CTT CCA TCC ACC ATT CCT TCC CCC CGC CCT 
TGA CAA CCC CCG CTA CAC GAA CCT ACC TGC TAA CGA ACC CCC CCG CCA 

1210 1220 1230 1240 1250 

• • * * * 

CCT CAA CCC CCT CAA CCT ACT ACT GGC CTC CTT CCT AAT CCA GGA CTC 
CCA CTT CCC GGA CTT CGA TCA TCA CCC CAC CAA CGA TTA CCT CCT CAC 

1260 1270 1280 1290 

♦ • * * 

CCA TAA CGC ACA CCC TCG ACC TCC GGC CGC CTT CCT CCC CTT TTT CCA 
CCT ATT CCC TCT CCC ACC TCC ACC CCC CCC CAA CCA CCG CAA AAA CCT 

1300 1310 1320 1330 1340 

• • • • * 

TAC CCT CCC CCC CCC TCA CCA CCA TCA CAA AAA TCG ACC CTC AAC TCA 
ATC CCA CGC CCG CCC ACT GCT CCT ACT CTT TTT ACC TCC GAG TTC ACT 

. 1350 1360 1370 1380 1390 

* * • * * 

GAG CTC CCC AAA CCC CAC ACC ACT ATA AAC ATA CCA CCC CTT TCC CCC 
CTC CAC CCC TTT CCC CTC TCC TGA TAT TTC TAT CCT CCC CAA ACG CCC 

1400 1410 1420 1430 1440 

• • * * • 

TGC AAC CTC CCT CCT CCC CTC TCC TCT TCC CAC CCT CCC CCT TAC CCC 
ACC TTC GAG CCA CCA CCC GAG ACC ACA ACC CTG CCA CCG CGA ATG CCC 

1450 1460 1470 1480 1490 

* • * * * 

ATA CCT CTC CCC CTT TCT CCC TTC GGC AAC CCT CGC GCT TTC TCA ATG 
TAT GGA CAC CCC GAA ACA CGC AAG CCC TTC CCA CCC CCA AAC ACT TAC 
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FIG. 5 D 



1500 1510 1520 1530 

• • • • 

CTC XCC CTC TAG CTA TCT CAC TTC GCT GTA CGT CCT TCC CTC CAA OCT 
GAG TCC GAC ATC CAT AGA CTC AAC CCA CAT CCA GCA AGC GAG GTT CCA 

1540 1550 1560 1570 1580 

* * ' * • * 

GGG CTG TCT GCA CGA ACC CCC CCT TCA GCC CCA CCG CTC CCC CTT ATC 
CCC GAC ACA CGT CCT TCG GGG GCA ACT CCG GCT CCC GAC CCC GAA TAG 

1590 1600 1610 1620 1630 

* * * * * 

CGG TAA CTA TCG TCT TCA CTC CAA CCC GGT AAC ACA CGA CTT ATC GCC 
CCC ATT GAT AGC AGA ACT CAC CTT GGG CCA TTC TCT GCT GAA TAG CCG 

1640 1650 1660 1670 16B0 

# * * * * 

ACT CCC ACC AGC CAC TCG TAA CAG GAT TAG CAC ACC GAG GTA TCT AGC 
TCA CCC TCG TCG CTG ACC ATT CTC CTA ATC CTC TCG CTC CAT ACA TCC 

1690 1700 1710 1720 1730 

* ♦ * • * 

CGG TCC TAC AGA CTT CTT CAA CTG CTG GCC TAA CTA CCC CTA CAC TAG 
GCC ACC ATC TCT CAA GAA CTT CAC CAC CGG ATT GAT GCC GAT CTG ATC 

1740 1750 1760 1770 

• * * * 

AAG GAC ACT ATT TCG TAT CTC CCC TCT CCT GAA GCC ACT TAC CTT CCG 
TTC CTC TCA TAA ACC ATA GAC CCC AGA CGA CTT CGG TCA ATG GAA CCC 

1780 1790 1800 1810 1820 

* * * * * 

AAA AAG ACT TCG TAC CTC TTC ATC CCG CAA ACA AAC CAC CCC TCC TAG 
TTT TTC TCA ACC ATC GAG AAC TAG GCC CTT TCT TTC GTC GCC ACC ATC 

1830 1840 1850 1860 1870 

* * * ' * * 

CGG TCC TTT TTT TCT TTC CAA CCA GCA GAT TAC CCG CAC AAA AAA ACC 
■ CCC ACC AAA AAA ACA AAC GTT CCT CCT CTA ATC CCC GTC TTT TTT TCC 

I860 1890 1900 1910 1920 

* * * * * 

ATC TCA AGA AGA TCC TTT GAT CTT TTC TAC GGG CTC TCA CCC TCA GTC 
TAG AGT TCT TCT AGC AAA CTA GAA AAG ATG CCC CAG ACT CCG ACT CAC 

1930 1940 1950 I960 1970 

♦ • * * ♦ 
CAA CCA AAA CTC ACC TTA ACC CAT TTT CCT CAT GAG ATT ATC AAA AAC 
CTT CCT TTT GAG TCC AAT TCC CTA AAA CCA CTA CTC TAA TAC TTT TTC 

1980 1990 2000 2010 

• • ♦ * 

CAT CTT CAC CTA CAT CCT TTT AAA TTA AAA ATC AAC TTT. TAA ATC AAT 
CTA GAA GTC GAT CTA GCA AAA TTT AAT TTT TAC TTC AAA ATT TAG TTA 

2020 2030 2040 2050 2060 

* * • * * 

CTA AAC TAT ATA TCA CTA AAC TTC CTC TCA CAG TTA CCA ATG CTT AAT 
GAT TTC ATA TAT ACT CAT TTC AAC CAC ACT CTC AAT CGT TAC GAA TTA 
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FIG. 5 E 



2070 2080 2090 2100 2110 

* * • ♦ ♦ 

CAG TGA CGC ACC TAT CTC ACC GAT CTG TCT ATT TCG TTC ATC CAT ACT 
CTC ACT CCG TCG ATA GAG TCG CTA GAC AGA TAA AGC AAG TAG GTA TCA 

2120 2130 2140 2150 2160 

* * * * * 

TGC CTG ACT CCC CGT CGT GTA GAT AAC TAC GAT ACG GGA GGG CTT ACC 
ACC GAC TGA GGG CCA CCA CAT CTA TTC ATG CTA TGC CCT CCC GAA TCG 

2170 2190 2190 2200 2210 

* * * * * 

ATC TCG CCC CAG TGC TGC AAT GAT ACC CCC AGA CCC ACG CTC ACC CGC 
TAC ACC CGC CTC ACG ACC TEA CTA TCG CCC TCT GGG TGC GAG TCG CCC 

2220 223 0 2240 2250 

• * * ♦ 

TCC AGA TTT ATC ACC AAT AAA CCA CCC ACC CCG AAG CCC CCA CCC CAG 
ACG TCT AAA TAG TCG TTA TTT GGT CCC TCC CCC TTC CCC CCT CCC CTC 

2260 2270 2280 2290 2300 

* # * • * 

AAG TCG TCC TGC AAC TTT ATC CCC CTC CAT CCA CTC TAT TAA TTC TTC 
TTC ACC ACG ACC TTC AAA TAG CCG GAC GTA CCT CAC ATA ATT AAC AAC 

2310 2320 2330 2340 2350 

* • • * * 

CCC GGA ACC TAC ACT AAC TAG TTC CCC ACT TAA TAC TTT CCC CAA CCT 
CCC CCT TCC ATC TCA TTC ATC AAG CGC TCA ATT ATC AAA CGC CTT CCA 

2360 2370 2380 2390 2400 

* * * * * 

TCT TCC CAT TCC TAC AGC CAT CCT CGT CTC ACC CTC CTC CTT TCC TAT 
ACA ACC CTA ACC ATC TCC CTA CCA CCA CAC TCC GAG CAC CAA ACC ATA 

2410 2420 2430 2440 2450 

* * * • * 

GCC TTC ATT CAC CTC CGC TTC CCA ACC ATC AAC CCC ACT TAC ATG ATC 
CCC- AAG TAA CTC GAC GCC AAG GGT TCC TAC TTC CCC TCA ATC TAC TAC 

2460 2470 2480 2490 

♦ • ♦ • 

CCC CAT CTT CTC CAA AAA ACC CCT TAG CTC CTT CCC TCC TCC CAT CCT 
CCC CTA CAA CAC CTT TTT TCC CCA ATC GAC GAA GCC ACG ACG CTA CCA 

2500 2510 2520 2530 2540 

* • • • * 

TCT CAC AAC TAA CTT CGC CCC ACT CTT ATC ACT CAT CCT TAT CCC ACC 
ACA CTC TTC ATT CAA CCC CCC TCA CAA TAG TCA CTA CCA ATA CCC TCC 

2550 2560 2570 2580 2590 

* * * * * 

ACT CCA TAA TTC TCT TAC TCT CAT CCC ATC CGT AAC ATC CTT TTC TCT 
TGA CCT ATT AAC ACA ATC ACA CTA CCG TAC CCA TTC TAC GAA AAG ACA 

2600 2610 2620 2630 2640 

* * * * * 

GAC TCC TCA GTA CTC AAC CAA CTC ATT CTC AGA ATA CTG TAT CCC CCC 
CTC ACC ACT CAT GAC TTC CTT CAC TAA CAC TCT TAT CAC ATA CCC CCC 
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FIG.5F 

2650 2660 2670 2680 2690 

ACC GAG. TTG CTC TTG CCC GGC CTC AAC ACC GGA TAA TAG CCC CCC ACA 
TGG CTC AAC GAG AAC CGG CCC CAG TTG TCC CCT ATT ATG CCC CCG TGT 

2700 27X0 2720 2730 



TAG CAG AAC TTT AAA ACT GCT CAT CAT TGG AAA ACC TTC TTC CCG CCG 
ATC GTC TTG AAA TTT TCA CGA GTA GTA ACC TTT TCC AAC AAC "CCC CCC 

2740 2750 2760 2770 2780 

* * ♦ * + 

AAA ACT CTC AAG GAT CTT ACC CCT CTT GAG ATC CAG TTC CAT GTA ACC 
TTT TGA GAG TTC CTA CAA TGG CGA CAA CTC TAG GTC AAG CTA CAT TCC 

2790 2800 2810 2820 2830 

CAC TCC TCC ACC CAA CTC ATC TTC ACC ATC TTT. TAG TIT CAC CAG CCT 
GTG AGC ACC TCC CTT CAC TAG AAG TCC TAG AAA ATC AAA CTC CTC CCA 

2840 2850 2860 2870 2880 

# * * * * 

TTC TCG CTC ACC AAA AAC ACC AAC CCA AAA TCC CCC AAA AAA CCC AAT 
. AAG ACC CAC TCG TTT TTG TCC TTC CCT TTT ACC CCC TIT TTT CCC TTA 

2890 .2900 2910 2920 2930 

* * • * • 
AAC CCC CAC ACC CAA ATC TTC AAT ACT CAT ACT CTT CCT TTT TCA ATA 
TTC CCC CTC TCC CTT TAC AAC TTA TGA GTA TCA GAA GGA AAA ACT TAT 

2940 2950 2960 2970 

TTA TTC AAG CAT TTA TCA CCG TTA TTC TCT CAT GAG CCC ATA CAT ATT 
AAT AAC TTC CTA AAT ACT CCC AAT AAC ACA CTA CTC CCC TAT GTA TAA 

2980 2990 3000 3010 3020 

• • * * * 

TCA ATC TAT TTA CAA AAA TAA ACA AAT AGG CCT TCC CCC CAC ATT TCC 
ACT TAC ATA AAT CTT TTT ATT TCT TTA TCC CCA ACC CCC CTC TAA ACC 

3030 3040 3050 3060 3070 

* * ♦ * » 

CCC AAA ACT CCC ACC TGA CCT CTA ACA AAC CAT TAT TAT CAT CAC ATT 
CCC TTT TCA CCC TCG ACT CCA CAT TCT TTC CTA ATA ATA GTA CTC TAA 

3080 3090 3100 3110 3120 

* • * • * * 

AAC CTA TAA AAA TAG CCC TAT CAC CAC CCC CTC ATC GCT CTT TGC GCC 
TTC CAT ATT TTT ATC CCC ATA GTG CTC CGG GAC TAC CGA GAA ACC CCG 

3130 3140 3150 3160 3170 

• * * • * 

ACC CAT CCT TCC TAA TCT TCC CTC CCA CCC ACC ACA ACC CTC AAC ACA 
TCG GTA CCA AGC ATT ACA AGG CAC CCT GCC TCC TCT TCC CAG TTC TCT 

3180 3190 .3200 3210 

* * . * * 

AAA TCT AAT CAC ACT CCC TCA CCT TCC GCT CCC CCT TTC TCC CTT TAT 
TTT ACA TTA CTC TCA CCC ACT GGA AGC CCA CCC GGA AAC ACC CAA ATX 
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FIG.5G 

3220 3230 3240 3250 3260 

* * * • # 

AAG GAG ACA CTT TAT GTT TAA GAA GGT TGG TAX AIT CCT TGC GCC TIT 
TTC CTC TGT GAA ATA CAA ATT CTT CCA ACC ATT TAA GGA ACC CCG AAA 

3270 3280 3290 3300 3310 

* * • * * 

GGC ACC CAA CCT AGA GAT CCG GCT CTC GAA TGT CTC TCA GTT AGG CTC 
CCG TOG GTT CCA TCT CTA GGC CCA CAC CTT ACA CAC ACT CAA TCC CAC 

3320 3330 3340 3350 3360 

* * • * ♦ 

TGG AAA CTC CCC AGG CTC CCC ACC AGG CAG AAG TAT GCA AAG CAT' CCA 
ACC TIT CAG GGC TCC GAG GGG TCC TCC GTC TTC ATA CCT TTC CTA CGT 

3370 3380 3390 3400 3410 

* • # * * 

TCT CAA TTA GTC AGC AAC CAG GCT CCC CAG CAG GCA GAA CTA TCC AAA 
AGA GTT AAT CAG TCC TTG CTC CCA GGG GTC GTC CCT CTT CAT ACC TTT 

3420 343 0 3440 3450 

• * ♦ * 

CCA TCC ATC TCA ATT ACT CAG CAA CCA TAG TCC CCC CCC TAA CTC CCC 
CCT ACC TAG ACT TAA TCA GTC GTT GGT ATC AGG GCG GGG ATT GAG CCG 

3460 3470 3480 3490 3500 

* * * ♦ ♦ 

CCA TCC CCC CCC TAA CTC CCC CCA CTT CCC CCC ATT CTC CCC CCC ATC 

GGT AGG CCG GGG ATT GAG CCG GGT CAA GCC GGC TAA GAG GCC GGG TAC 

3510 3520 3530 3540 3550 

* ■ * * 

GCT CAC TAA TTT TTT TTA TTT ATC CAG AGC CCC AGC CCG CCT CCG CCT 
CGA CTC ATT AAA AAA AAT AAA TAC GTC TCC GGC TCC CCC GGA GCC GGA 

3560 3570 3580 3590 3600 

CTC AGC TAT TCC AGA ACT ACT GAG GAG CCT TTT TTC GAG GCC TAG CCT 
CAC TCC ATA AGG TCT TCA TCA CTC CTC CCA AAA AAC CTC CCG ATC CGA 

3610 3620 3630 3640 3650 

* 

TTT CCA AAA ACC TAG CTT GGC GCC ACC CCT CAG AGC ACC TTC CAC CAT 
AAA CCT TTT TCC ATC CAA CCC CCG TGG CGA CTC TCC TGG AAG GTC CTA 

3660 3670 36B0 3690 

♦ * * 

GCC CAC CTC AGC AAG TTC CCA CTT GAA CAA AAA CAT CAA GCA AAT CTA 
CCC GTC GAG TCC TTC AAG GGT GAA CTT CTT TTT CTA GTT CCT TTA CAT 

3700 3710 3720 3730 3740 

* • • ♦ * 

CTT GTG CCT CCC CCA GCG TCA CAA ACT CCA AGC CAT CTA TAT CTC GGT 
GAA CAC CGA CCC CGT CCC ACT CTT TCA CCT TCC GTA CAT ATA GAC CCA 

37 50 3760 3770 3780 



* 



3790 



TCA TGG TAC TCC AGA AGG ACT GCG CTC CAA AAC CCC CAC CCT GGA CTG 
ACT ACC ATC ACC TCT TCC TCA CCC GAC GTT TTG GGC GTC GGA CCT GAC 
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FIG. 5 H 



3B00 3810 3820 3830 3840 

* • * * * 

TGA GCC CAA GTC TGT AGA ASA GTT ACC TGA GTC GAA TIT TGA TCC CTC 
ACT CGG GTT CAC ACA TCT TCT CAA TCC ACT CAC CTT AAA ACT ACC GAG 

3850 3860 3870 3880 3990 

* * * * * 

TAG TAC CTT TCA GTC TGA GGG CTC CAA CAG TGA CAT GTA TCT CAC CTC 
ATC ATG GAA ACT CAG ACT CCC GAG GTT GTC ACT GTA CAT AGA GTC GGG 

3900 3910 3920 3930 

* * • * 

TGT TCC CAT CTT TCC GGA CCC CTT CCC CAG AGA TCC CAA CAA GCT CCT 
ACA ACC GTA CAA ACC CCT GGG GAA GGC CTC TCT ACG GTT GTT CCA CCA 

3940 3950 3960 3970 3980 

• * * * * 

GTT CTC TGA ACT TTT CAA GTA CAA CCG GAA GCC TCC ACA CAC CAA TIT 
CAA GAC ACT TCA AAA GTT CAT GTT CCC CTT CCG ACC TCT CTC GTT AAA 

3990 4000 4010 4020 4030 

* * * * * 

AAG CCA CTC GTC TAA ACC GAT AAT GGA CAT GCT GAG CAA CCA CCA CCC 
TTC OCT GAG CAC ATT TCC CTA TTA CCT GTA CCA CTC GTT GCT CCT GGC 

4040 4050 4060 4070 4080 

* * * * * 

CTC CTT TCC AAT GGA ACA GGA GTA TAC TCT GAT CCC AAC AGA TCC CCA 
CAC CAA ACC TTA CCT TGT CCT CAT ATC AGA CTA CCC TTC TCT ACC CCT 

4090 4100 4110 4120 4130 

* * ♦ * • 

CCC TTT TCC TTC CCC TTC CAA TCC CTT TCC TCC CCC CCA ACG TCC CTA 

GGG AAA ACC AAC CCG AAG CTT ACC CAA AGG ACC CCC GGT TCC ACC CAT 

4140 4150 4160 4170 

• * • ♦ 

TTA CTC TGC TCT CCC CCC AGA CAA AGC CTA TCG CAC GGA TAT CCT GGA 
AAT GAC ACC ACA CCC CCC TCT CTT TCC GAT ACC GTC CCT ATA GCA CCT 

4180 4190 4200 4210 4220 

• ♦ • # • 

CCC TCA CTA CCC CCC CTC CTT GTA TCC TCC CCT CAA GAT TAC ACG AAC 
CCG ACT GAT CCC CCC CAC CAA CAT ACC ACC CCA GTT CTA ATC TCC TTC 

4230 4240 4250 4260 4270 

* • • * • 

AAA TCC TCA GGT CAT CCC TCC CCA CTC GGA ACT CCA AAT AGC ACC CTC 
TTT ACC ACT CCA GTA CCC ACC GGT CAC CCT TGA GGT TTA TCC TCC CAC 

4280 4290 4300 4310 4320 

TGA AGG AAT CCG CAT GGG AGA TCA TCT CTC CCT CCC CCC TTT CAT CTT 
ACT TCC TTA CCC CTA CCC TCT ACT AGA GAC CCA CCC GGC AAA GTA CAA 

4330 4340 4350 4360 4370 

* * * * * 

NCA TCG ACT ATG TGA AGA CTT TCC GCT AAT AGC AAC CTT TGA CCC CAA 
HGT AGC TCA TAC ACT TCT GAA ACC CCA TEA TCG TTC GAA ACT CCG CTT 

RECTIFIED SHEET (RULE 91) 
ISA/EP 



WO 96/40921 



PCT/US96/09287 



31/41 

FIG. 5 I 

4380 4390 4400 

* * 



4410 



CCC CAT TCC TCC GAA CTG GAA TCG TGC AGG CTC CCA TXC r\x r**r* m»** 
CCG CTA AGG ACC CTT GAC CTT ACC ACG TCC GAC GGT ATG GTT GAA ATC 

442 ° 4430 4440 4450 4460 

* * * * " 

^^^ aT ^^^^raTCTCAAccACAT^c^ 

CTC CTT CCC CTA CCC CCT CCT CTT ACC AGA CTT OCT CTA GCT CCT OK 

4 ™ " B l 4490 4500 4510 

™^^^^^^^ OT ^^<^T03AGC™ 
CTA CCT CTT TCA TTC GTT CCC CCT GGC CAT CCT CTA ACC TCG GAT OCT 

«20 4530 4540 4550 4 560 

TCCCAACMCCCCCTCGACAATCCCC»TTCTCTG3^ 

«70 4580 4590 4600 4610 

™^^Sr^^ OTTO ^^ TOTO ^To:c^T^c 

TTC CAG CTT GTA CTT CCT GAA AAG ACC ACC ACA CCC CTT ACC CTC ACC 

4620 4630 4640 4650 

• * * 

CAG CAT CCC CAT TCC CCC GAC TOP CCC CCA CGA GAA GAA AGG TTA CTT 
CTC CTA CCC CTA ACG GGC CTG ACA CCC CCT CCT CTT CTT TCC AAT GAA 

4660 4670 4680 4690 4700 

# 

TGA AGA CCC CGG CCC CTC TGC CAA TTC TCA CCC CTT TGC ACT GAC AGA 
ACT TCT GGC GCC CCC GAG ACC CTT AAC ACT CGG GAA ACC TCA CTC TCT 

4710 4720 4730 4740 4750 

AGC CAT CCT CCG CAC ATC CCT TCT CAA TGA GAC TCG CCA CGA GCC CTT 
TCG CTA CCA GCC CTG TAC CGA AGA CTT ACT CTG ACC CGT CCT CGG GAA 



4760 4770 47B0 4790 



4800 



CCA ATA CAA AAA CTA ATT AGA CTT TGA CTG ATC TTG ACC CTT TCC TAG 
CCT TAT CTT TTT GAT TAA TCT GAA ACT CAC TAG AAC TCC GAA AGG ATC 



4B10 4820 4830 4840 



* 



4850 



TTC ATC CCA CCC CCC CCC AGA GAG ATC TTT CTC AAC CAA CCT TAC TTC 
AAG TAG CGT GCC CCC CCG TCT CTC TAG AAA CAC TTC CTT CGA ATC AAG 

4860 4870 4880 4690 

♦ ♦ ♦ 

TCT CCT CTC ACA TAA TTC GAC AAA CTA CCT ACA GAC ATT TAA AGC TCT 
ACA CCA CAC TCT ATT AAC CTC TTT GAT GGA TCT CTC TAA ATT TCC AGA 



4900 4910 4920 4930 



4940 



AAG GTA AAT ATA AAA TTT TTA ACT CTA TAA TCT CTT AAA CTA CTC ATT 
TTC CAT TTA TAT TTT AAA XAT TCA CAT ATT ACA CAA TTT GAT GAC TAA 
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FIG. 5 J 

4950 4960 4970 49B0 4990 

* * * * * 

CTA ATT GTT TGT GTA TTT TAG ATT CCA ACC TAT GGA ACT GAT GAA TCC 
GAT TAA CAA ACA CAT AAA ATC TAA GCT TGG ATA CCT TGA CTA CTT ACC 

5000 5010 5020 5030 5040 

* * * * * 

GAG CAG TGG TGG AAT GCC TTT AAT GAG GAA AAC CTG TTT TCC TCA GAA 
CTC CTC ACC ACC TTA CGC AAA TTA CTC CTT TTG GAC AAA ACC ACT CTT 

5050 5060 5070 5080 5090 

* * * • » 

CAA ATG CCA TCT ACT GAT CAT GAG CCT ACT CCT GAC TCT CAA CAT TCT 
CTT TAG GCT ACA TCA CTA CTA CTC CGA TGA CCA CTG ACA GTT GTA ACA 

5100 5110 5120 5130 

* • * * 

ACT CCT CCA AAA AAG AAG AGA AAG GTA GAA GAC CCC AAG GAC TTT CCT 
TCA GGA GGT TTT TTC TTC TCT TTC CAT CTT CTG GGC TTC CTG AAA GGA 

5140 5150 5160 5170 5180 

* # * * * 

TCA GAA TTG CTA ACT TTT TTG ACT CAT CCT GTG TTT ACT AAT AGA ACT 
ACT CTT AAC GAT TCA AAA AAC TCA CTA CGA CAC AAA TCA TEA TCT TGA 

5190 5200 5210 5220 5230 

♦ • * • * 

CTT CCT TCC TTT CCT ATT TAG ACC ACA AAG GAA AAA GCT CCA CTC CIA 
GAA CCA ACC AAA CGA TAA ATC TGG TGT TTC CTT TTT CGA CCT GAC GAT 

5240 5250 5260 5270 52B0 

* * * * - • 
TAC AAG AAA ATT ATG GAA AAA TAT TCT CTA ACC TTT ATA ACT AGG CAT 
ATC TTC TTT TAA TAC CTT TIT ATA AGA CAT TCC AAA TAT TCA TCC CTA 

5290 5300 5310 5320 5330 

* * • • * 

AAC ACT TAT AAT CAT AAC ATA CTC TTT TTT CTT ACT CCA CAC AGG CAT 
TTC TCA ATA TTA GTA TTG TAT GAC AAA AAA GAA TGA GGT CTC TCC CTA 

5340 5350 5360 5370 

* * • • 

AGA GTG TCT CCT ATT AAT AAC TAT CCT CAA AAA TTG TCT ACC TTT ACC 
TCT CAC AGA CGA TAA TTA TTG ATA CGA GTT TTT AAC ACA TGG AAA TCC 

5380 5390 5400 5410 5420 

• ♦ * * * 

TTT TTA ATT TCT AAA GGC CTT AAT AAC GAA TAT TTG ATC TAT ACT GCC 
AAA AAT TAA ACA TTT CCC CAA TTA TTC CTT ATA AAC TAC ATA TCA CCC 

5430 5440 5450 5460 5470 

• * • * * 

TTC ACT ACA GAT CAT AAT CAG CCA TAC CAC ATT TCT AGA GCT TTT ACT 
AAC TCA TCT CTA GTA TTA CTC CCT ATC CTG TAA ACA TCT CCA AAA TGA 

5480 5490 5500 5510 5520 

* * * * « 
TCC TTT AAA AAA CCT CCC ACA CCT CCC CCT GAA CCT GAA ACA TAA AAT 
-ACC AAA TTT TTT GGA GGG TCT GGA GGC GGA CTT GGA CTT TCT ATT TTA 
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FIG. 5 K 



5530 5540 5550 5560 5570 

* * * # « 

GAA TGC AAT TGT TCT TCT TAA CTT GTT TAT TGC AGC TTA TAA TGG TTA 
CTT ACG TTA ACA ACA ACA ATT GAA CAA ATA ACC TCC AAT ATT ACC AAT 

5580 5590 5600 5610 

* * * # 

CAA ATA AAG CAA TAG CAT CAC AAA TTT CAC AAA TAA AGC ATT TIT TTC 
GTT TAT TTC GTT ATC GTA GTG TIT AAA GTG TTT ATT TCC TAA AAA AAG 

5620 5630 5640 5650 5660 

* * • * ♦ 

ACT GCA TTC TAG TTC TGG TTT GTC CAA ACT CAT CAA TCT ATC TTA TCA 
TCA CCT AAG ATC AAC ACC AAA CAG GTT TGA GTA GTT ACA TAG AAT ACT 

5670 5680 5690 5700 5710 

• * ♦ • » 

TCT CTC GAT CTC TAG CTT CCT GTC AAG CAC GGT CAC TCC ACT GAA TAA 
ACA GAC CTA GAG ATC CAA CCA CAG TTC CTC CCA CTG ACC TCA CTT ATT 

5720 5730 5740 5750 5760 

* * * * # 

TAA AAT GTC TCT TTC TCC CAA ATA CGC CTT TTC AOA'TTT CTC TCC CTC 
ATT TTA CAC ACA AAC AGC CTT TAT GCG CAA AAC TCT AAA GAC AGC CGC 

5770 5780 5790 5800 5810 

* • ♦ # # 
ACT AAA TTC ATC TCC CGC CAT ACT GGT CTT TAT CCC CCA TAG ACA TGC 
TCA TTT AAG TAC AGC GCG CTA TCA CCA CAA ATA CCC CCT ATC TCT ACC 

5820 5B30 5840 5850 

♦ • • • 

CCA TAT TCC AAA AAT CCA TAT TTC AAA ATA TGG CAT ATT CAA AAT GTC 
CCT ATA ACC TTT TTA CCT ATA AAC TTT TAT ACC GTA TAA CTT TTA CAC 

5860 5870 5B80 5890 5900 

* * * • * 

CCC GAT CTC ACT TTC TCT GTA ACT CAT ATC CCC ATT TTT CCA AAA CTC 
CGC CTA CAC TCA AAG ACA CAT TCA CIA TAG CCG TAA AAA GGT TTT CAC 

5910 5920 5930 5940 5950 

• * * * # 

ATT TTT CGC CAT ACG CCA TAT CTC GCG ATA CCG CTT ATA TCC TTT ACC 
TAA AAA CCC GTA TCC GCT ATA GAC CCC TAT CCC CAA TAT AGC AAA TGC 

5960 5970 5980 5990 6000 

* • * * * 

CCG CAT CGC GAT ACA CGA CTT TCC TCA CTT CCC CCA TTC TCT CTC TCC 
CCC CTA CCC CTA TCT CCT CAA ACC ACT CAA CCC CCT AAG ACA CAC AGC 

6010 6020 6030 6040 6050 

* • ♦ • * 

CAA ATA TCC CAC TTT CCA TAT AGG TCA CAG ACG ATA TGA GCC TAT ATC 
CTT TAT AGC CTC AAA CCT ATA TCC ACT CTC TCC TAT ACT CCC ATA TAG 

6060 6070 6080 6090 

• • • ♦ 

CCC GAT ACA CCC CAC ATC AAG CTC CCA CAT GCC CAA TGC ATA TCC ATC 
CCC CTA TCT CCC CTC TAG TTC GAC CCT CTA CCG GTT ACC TAT ACC TAC 
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FIG. 5 L 

6100 6110 6120 6130 



6140 



* 



TAT ACA TTC AAT CAA TAT TCG CCA TTA GCC ATA TTA TTC ATT OCT TAT 
ATA TOT AAC TTA GTT ATA ACC GGT AAT CGG TAT AAT AAC TAA CCA ATA 

6150 6160 6170 6180 6190 

ATA GCA TAA ATC AAT ATT GCC TAT TCG CCA TTC CAT ACC TTG TAT CCA 
TAT CCT ATT TAG TTA TAA COG ATA ACC GGT AAC GTA TCC AAC ATA GGT 

6200 6210 6220 6230 6240 

TAT CAT AAT ATC TAC AIT TAT ATT GCC TCA TCT CCA ACA TTA CCG CCA 
ATA GTA TTA TAC ATC TAA ATA TAA CCG ACT ACA GGT TCT AAT GCC CCT 

6250 6260 6270 6280 6290 

TCT TGA CAT TGA TTA TTG ACT ACT TAT TAA TAG TAA TCA ATT ACC CCG 
ACA ACT GTA ACT AAT AAC TGA TCA ATA ATT ATC ATT ACT TAA TCC CCC 

$300 6310 6320 6330 

TCA TTA GTT CAT ACC CCA TAT - ATC GAG TTC CCC GTT ACA TAA CTT ACQ 
ACT AAT CAA GTA TCC CGT ATA TAC CTC AAC CCG CAA TCT ATT GAA TCC 



6340 6350 6360 6370 



6380 



CTAAATCGCCCCOTCGCTCACCGCCCAACGACCCCCGCC^ 

CAT TTA CCC GCC CCA OXACTCCCGCGTrCCTCCCGCCCCCTAACTCC 

6390 6400 6410 6420 6430 

• * * * * 

TCA ATA ATC ACC TAT CTT CCC ATA GTA ACC CCA ATA CCC ACT TTC CAT 
ACT TAT TAC TCC ATA CAA GCC TAT CAT TCC CCT TAT CCC TCA AAC GTA 

6440 6450 6460 6470 6480 

* * * ♦ * 

TCA CCT CAA TCG CTC GAG TAT TTA CCG TAA ACT CCC CAC TTC CCA GTA 
ACT CCA CTT ACC CAC CTC ATA AAT CCC ATT TCA CCG CTC AAC CCT CAT 

6490 6500 6510 6520 6530 

• * • • * 

CAT CAA CTC TAT CAT ATC CCA ACT ACC CCC CCT ATT CAC CTC AAT CAC 
GTA GTT CAC ATA GTA TAC CGT TCA TCC CCC GCA TAA CTC CAC TTA CTC 

6540 6550 6560 6570 

• * * * 

GGT AAA TCC CCC CCC TCC CAT TAT GCC CAC TAC ATC ACC TTA TCC CAC 
CCA TIT ACC CCC CCC ACC GTA ATA CCG CTC ATC TAC TCC AAT ACC CTC 

6580 6590 6600 6610 6620 

• * • * • 

TTT CCT ACT TCC CAC TAC ATC TAC GTA TTA CTC ATC CCT ATT ACC ATC 
AAA CCA TCA ACC CTC ATC. TAC ATC CAT AAT CAC TAG CCA TAA TCG TAC 

6630 6640 6650 6660 6670 

• ♦ • * * 

CTC ATC CCC TTT TCC CAC TAC ATC AAT CCG CCT GCA TAC CCC TTT CAC 
CAC TAC CCC AAA ACC CTC ATC TAG TTA CCC GCA CCT ATC CCC AAA CTC 
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FIG.5M 

6680 6690 6700 6710 6720 

* ■ * 

TCA CGG GGA TIT CCA ACT CTC CAC CCC- ATT GAC CTC AAT GCG ACT TIC 
ACT CCC CCT AAA CCT TCA GAG CTG CGG TAA CTC CAG TEA CCC TCA AAC 



6730 6740 6750 6760 



mo 



♦ • ♦ 



TTT TCG CAC CAA AAT CAA CGG GAC TIT CCA AAA TGT CCT AAC AAC TCC 
AAA ACC GTG CTT TTA GTT CCC CTG AAA GGT TTT ACA CCA TIG TTC ACC 

€780 6790 6000 6810 

* * * ♦ 

GCC CCA TTG ACG CAA ATG GCC CCT ACC CCT GTA CCC TCG CAG GTC TAT 
CCG CCT AAC TCC GTT TAC CCG CCA TCC CCA CAT CCC ACC CTC CAG ATA 

6820 6830 6840 6850 6860 

♦ ♦ * # 

ATA ACC ACA GCT CCT TTA CTC AAC CCT CAG ATC CCC TCG ACA CCC CAT 
TAT TCC TCT CCA CCA AAT CAC TTG CCA GTC TAG CGG ACC TCT CCG GTA 

6870 6880 6890 6900 6910 

CCA CCC TCT TTT GAC CTC CAT ACA ACA CAC CCG GAC CCA TCC ACC CTC 
GCT CCG ACA AAA CTC CAG GTA TCT TCT GTC CCC CTG GCT AGG TCC GAG 

6520 6930 6940 6950 6960 

CCCCGCC^CAACCCTGCATrCCAACGCCGATrCCCOTCCXAAC^ 
GCC CCG GCC CTT CCC ACC TAA CCT TCC CCC TAA GCC CCA CCC TTC TCA 

6970 6980 6990 7000 



7010 

GAC GTA ACT ACC CCC TAT ACA GTC TAT AGG CCC ACC CCC TTG GCT TCT 
CTG CAT TCA TCG CCC ATA TCT CAG ATA TCC CCG TCG GCC AAC CCA AGA 

7020 7030 7040 7050 

* • * ♦ 

TAT CCA TCC TAT ACT CTT TTT CCC TTC CCC TCT ATA CAC CCC CCC TTC 
ATA CCT ACC ATA TCA CAA AAA CCC AAC CCC AGA TAT GTG GGG CCC AAG 

7 °60 7070 7080 7090 7100 

CTC ATC TTA TAG CTC ATC CTA TAG CTT ACC CTA TAG CTG TCG GTT ATT 
CAG TAC AAT ATC CAC TAC CAT ATC CAA TCG GAT ATC CAC ACC CAA TAA 

7110 7120 7130 7140 7150 

GAC CAT TAT TCA CCA CTC CCC TAT TCC TCA CCA TAC TTT CCA TTA CTA 
CTC CTA ATA ACT GCT GAC GCC ATA ACC ACT CCT ATC AAA GCT AAT GAT 

7160 7170 7180 7190 7200 

♦ * 

ATC CAT AAC ATC CCT CTT TCC CAC AAC TCT CTT TAT TCC CTA TAT GCC 
TAC CTA TTC TAC CCA CAA ACC CTC TTC ACA CAA ATA ACC CAT ATA CCG 

7210 7220 7230 7240 7250 

AAT ACA CTC TCC TTC AGA GAC TCA CAC GGA CTC TOT ATT TTT ACA CCA 
TTA TCT GAC AGG AAG TCT CTG ACT GTC CCT GAG ACA TAA AAA TGT CCT 
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FIG.5N 

7260 7270 72B0 7390 

* 

TGG GGT CTC ATT TAT TAT TTA CAA ATT CAC ATA TAC AAC APT 1 xrr r~rr> 
ACC CCA GAG TAX ATA ATA AAT CTT TAA CTO TAT ATG TTC TOG TOG CAG 



7 300 7310 7320 7330 



7340 



CCC ACT CCC CCC ACT TTT TAT TAA ACA TAA OCT GGG ATC TCC ACC «•» 
GGG TCA CGG GCG TCA AAA ATA ATT TGT ATT CCA CCC TAG AGS TCC GCT 

7350 7360 7370 73B0 7390 

• * * • 

ATC TCG GOT ACC TGT TCC GGA CAT GGG CTC TTC TCC GGT AGC car «•» 

tagagcc^tccacaaggotctaccccagaacaSStSSot 



7400 7410 7420 7430 



7440 



ccaagatctaggctc^acgacggtacgcaggtccctoactaSaS 

?«0 7460 7470 7400 7490 

CTC CCC AGC TCC TTC CTC CIA ACA CTO GAG CCC ACA CTT AGS rxr »m» 

<^ccctccaccaaccaccattctcacc^<^t1£ 

7500 7510 7520 7530 

* * • • 

ACG ATC CCC ACC ACC ACCMTCTBCCBOCAAecCCoreepr™ 

TCCTACCGGTCGTCGTGCTCAC^CGCCTCTOcSScSc^ 
755 2 7560 7570 75B0 

TAT CTC TCT CAA AAT CAC CTC CCC CAC CCC OCT TOP »rr rJ^» 
ATACACACAOTTTACTCCACCXSrc^GSS 

7590 7600 7610 7620 7630 

• • ♦ * * 

TTT GGA ACA CTT AAC CCA CCC CCA CAA CAA GAT CCA CCC ACC TCA CTT 
AAA CCT TCT CAA TTC CCT CCC OCT CTT CTT CTA CCT CCC TCC ACT CAA 

7640 7650 7660 7670 76B0 

♦ * * * # 

CTT CTC TTC TCA TAA CAC TCA CAC CTA ACT CCC CTT CCC CTC CTC TTA 
CAA CAC AAC ACT ATT CTC ACT CTC CAT TCA CCC CAA CCC CAC CAC AAT 

7690 7700 7710 7720 7730 

* * • • • • 

ACC CTC CAC CCC ACT CTA CTC TCA CCA CTA CTC CTT CCT CCC CCC CCC 
TCC CAC CTC CCC TCA CAT CAC ACT CCT CAT CAC CAA CCA CCC CCC CCC 

7740 7750 7760 7770 

* • • • 

CCC ACC ACA CAT AAT ACC TCA CAC ACT AAC ACA CTC TTC CTT TCC ATC 
CCC TCG TCT CTA TTA TCC ACT CTC TCA TTC TCT CAC AAC GAA ACG TAC 

7780 7790 7800 7810 7820 

• • * * ♦ 

CCT CTT TTC TCC ACT CAC CCT CCT TCA CAC GAA CCT TCG CCT CCA CCT 
CCA CAA AAC ACC TCA CTC CCA CCA ACT CTC CTT CCA ACC CCA CCT CCA 
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FIG. 5 O 



7B30 7840 7850 7860 

* • * * 

CCA TCC ACT CIA GAG GAT OCA TCC CCC CCC GAG CTC O 

CCT AGC TGA GAT CTC CTA GCT AGG GGC CCG CTC GAG C 
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